-
Food Photography with Generative AI
How create hyper realistic food photography with Generative AI in 4k resolution ready to print (4,736 x 3,520 px). Step-by-step tutorial below. his is tutorial 1/20 of the Freepik’s Mystic exploration series @javilopen planning to do in which he will cover the main categories of image generation that any professional might need. Follow Javi at…
-
LlamaParse Premium
LlamaParse premium is the best document parser out there for your context-augmented LLM application. It can handle complex slide decks, diagrams, multi-table Excel sheets, interleaving scanned document text, and any other document type with lots of text, tables, and visual elements. Dev Team spent a lot of effort to reduce the cost! Excited to announce…
-
Nvidia Nemotron 51B
Nvidia has just announced the release of Nemotron 51B, and it’s set to redefine AI model performance across the board. This new model is 220% faster and capable of handling 400% more workload than its predecessor, Llama 3.1 70B, making it a significant leap in terms of both speed and scalability. Better yet, it’s permissively…
-
AI NEWS [Mon Sep 16, 2024]
AI NEWS: The “Godmother of AI” just launched World Labs, teaching AI to create and understand 3D worlds. Plus, more developments from OpenAI, Tencent, Runway, Google, and Meta. Here’s everything that happened in AI over the weekend. Fei-Fei Li announced a new startup Fei-Fei Li announced a new startup, World Labs, to develop AI models…
-
Pixtral 12B (pixtral-12b-240910)
Mistral released Pixtral 12B Vision Language Model (pixtral-12b-240910). Some notes on the release below. Installation Mistral common has image support! You can now pass images and URLs alongside text into the user message. To use the model checkpoint: Images You can encode images as follows: Image URLs You can pass image url which will be…
-
Workspaces – Anthropic API Console
Anthropic has introduced Workspaces in the Anthropic API Console to assist developers in efficiently managing multiple Claude deployments. These Workspaces serve as unique environments that allow for the organization of resources, the streamlining of access controls, and the setting of custom spend and rate limits at a more granular level. For developers deploying Claude across…
-
LLaVA V1.5 7B on Groq
LLaVA (Large Language and Vision Assistant) is an open-source multimodal chatbot designed to follow instructions across different modes of communication. It is developed by fine-tuning the LLaMA/Vicuna language models on data generated by GPT, enabling it to understand and generate both text and visual information. Built on the transformer architecture, LLaVA operates as an auto-regressive…
-
Vidu AI
Vidu AI is a powerful new tool for generating videos from text and images. As it enters the market, it positions itself as a competitor to other video generation tools like Sora AI, Runway ML, and Pikalabs. This guide will walk you through how to use Vidu AI’s text-to-video and image-to-video features, showing you how…
-
PixVerse V2.5
Introducing PixVerse V2.5! Ready to create flawless AI videos without lag or distortion? Explore latest upgrades and unleash your creativity. With enhanced prompt understanding, your creative visions are now sharper and more precise with PixVerse V2.5. Activate PixVerse’s Performance Mode to achieve stunning motion effects and vivid details. Plus, our Magic Brush is now available…
-
Grok-2
Rowan Cheung, founder of @therundownai, got early (beta) access to Grok 2. Spent all night testing it. Here are 6 things it can do that ChatGPT cannot. First, what is Grok 2? Grok 2 is xAI‘s newest AI model that’s slowly rolling out on X for premium users Its edge over other LLMs is that…