Gen AI

BLOG

Food Photography with Generative AI

How create hyper realistic food photography with Generative AI in 4k resolution ready to print (4,736 x 3,520 px). Step-by-step tutorial below. his is tutorial 1/20 of the Freepik’s Mystic exploration series @javilopen planning to do in which he will cover the main categories of image generation that any professional might need. Follow Javi at…
Continue reading

October 11, 2024
LlamaParse Premium

LlamaParse premium is the best document parser out there for your context-augmented LLM application. It can handle complex slide decks, diagrams, multi-table Excel sheets, interleaving scanned document text, and any other document type with lots of text, tables, and visual elements. Dev Team spent a lot of effort to reduce the cost! Excited to announce…
Continue reading

October 8, 2024
Nvidia Nemotron 51B

Nvidia has just announced the release of Nemotron 51B, and it’s set to redefine AI model performance across the board. This new model is 220% faster and capable of handling 400% more workload than its predecessor, Llama 3.1 70B, making it a significant leap in terms of both speed and scalability. Better yet, it’s permissively…
Continue reading

September 24, 2024
AI NEWS [Mon Sep 16, 2024]

AI NEWS: The “Godmother of AI” just launched World Labs, teaching AI to create and understand 3D worlds. Plus, more developments from OpenAI, Tencent, Runway, Google, and Meta. Here’s everything that happened in AI over the weekend. Fei-Fei Li announced a new startup Fei-Fei Li announced a new startup, World Labs, to develop AI models…
Continue reading

September 16, 2024
Pixtral 12B (pixtral-12b-240910)

Mistral released Pixtral 12B Vision Language Model (pixtral-12b-240910). Some notes on the release below. Installation Mistral common has image support! You can now pass images and URLs alongside text into the user message. To use the model checkpoint: Images You can encode images as follows: Image URLs You can pass image url which will be…
Continue reading

September 11, 2024
Workspaces – Anthropic API Console

Anthropic has introduced Workspaces in the Anthropic API Console to assist developers in efficiently managing multiple Claude deployments. These Workspaces serve as unique environments that allow for the organization of resources, the streamlining of access controls, and the setting of custom spend and rate limits at a more granular level. For developers deploying Claude across…
Continue reading

September 10, 2024
LLaVA V1.5 7B on Groq

LLaVA (Large Language and Vision Assistant) is an open-source multimodal chatbot designed to follow instructions across different modes of communication. It is developed by fine-tuning the LLaMA/Vicuna language models on data generated by GPT, enabling it to understand and generate both text and visual information. Built on the transformer architecture, LLaVA operates as an auto-regressive…
Continue reading

September 4, 2024
Vidu AI

Vidu AI is a powerful new tool for generating videos from text and images. As it enters the market, it positions itself as a competitor to other video generation tools like Sora AI, Runway ML, and Pikalabs. This guide will walk you through how to use Vidu AI’s text-to-video and image-to-video features, showing you how…
Continue reading

August 27, 2024
PixVerse V2.5

Introducing PixVerse V2.5! Ready to create flawless AI videos without lag or distortion? Explore latest upgrades and unleash your creativity. With enhanced prompt understanding, your creative visions are now sharper and more precise with PixVerse V2.5. Activate PixVerse’s Performance Mode to achieve stunning motion effects and vivid details. Plus, our Magic Brush is now available…
Continue reading

August 27, 2024
Grok-2

Rowan Cheung, founder of @therundownai, got early (beta) access to Grok 2. Spent all night testing it. Here are 6 things it can do that ChatGPT cannot. First, what is Grok 2? Grok 2 is xAI‘s newest AI model that’s slowly rolling out on X for premium users Its edge over other LLMs is that…
Continue reading

August 21, 2024