-
2025: The Year of Truly Personal AI Agents
The year 2025 holds incredible promise for personal AI agents that are more intuitive, capable, and deeply integrated into our lives. Imagine an agent that doesn’t just respond to queries but understands you holistically—your voice, your habits, and your unique digital ecosystem. These next-generation agents could revolutionize how we interact with technology, finally delivering on…
-
Pixtral-Large Model
Mistral AI just released its most powerful model yet. The benchmarks are pretty wild. Their new 124B parameter Pixtral-Large multimodal model. Mistral’s Le Chat platform also got revamped, complete with web search, document analysis, and image generation (powered by Black Forest Labs’ Flux Pro). Six months ago, the gap between open and closed models was…
-
FrontierMath
Today EpochAI launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. They collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%. Existing math benchmarks like GSM8K and MATH are approaching saturation, with AI models scoring over 90%—partly due to…
-
DeepMind released AlphaFold 3 inference codebase
Wohoo! DeepMind released AlphaFold 3 inference codebase, model weights and an on-demand server! AlphaFold can generate highly accurate biomolecular structure predictions containing proteins, DNA, RNA, ligands, ions, and also model chemical modifications for proteins and nucleic acids in one platform. The model weights require you to fill a Google Form and wait for 3-4 business…
-
Gen AI News [November 1st, 2024]
Another great week in open ML!Here’s a small recap by @mervenoyann. Model releases ⏯️ Video Language Models @AIatMeta released LongVU, a new family of state-of-the-art long video LM models based on DINOv2, SigLIP, Qwen2 and Llama 3.2 💬 Small language models @huggingface released SmolLM2, a new family of smol language models with Apache 2.0 license…
-
Food Photography with Generative AI
How create hyper realistic food photography with Generative AI in 4k resolution ready to print (4,736 x 3,520 px). Step-by-step tutorial below. his is tutorial 1/20 of the Freepik’s Mystic exploration series @javilopen planning to do in which he will cover the main categories of image generation that any professional might need. Follow Javi at…
-
LlamaParse Premium
LlamaParse premium is the best document parser out there for your context-augmented LLM application. It can handle complex slide decks, diagrams, multi-table Excel sheets, interleaving scanned document text, and any other document type with lots of text, tables, and visual elements. Dev Team spent a lot of effort to reduce the cost! Excited to announce…
-
Nvidia Nemotron 51B
Nvidia has just announced the release of Nemotron 51B, and it’s set to redefine AI model performance across the board. This new model is 220% faster and capable of handling 400% more workload than its predecessor, Llama 3.1 70B, making it a significant leap in terms of both speed and scalability. Better yet, it’s permissively…
-
AI NEWS [Mon Sep 16, 2024]
AI NEWS: The “Godmother of AI” just launched World Labs, teaching AI to create and understand 3D worlds. Plus, more developments from OpenAI, Tencent, Runway, Google, and Meta. Here’s everything that happened in AI over the weekend. Fei-Fei Li announced a new startup Fei-Fei Li announced a new startup, World Labs, to develop AI models…
-
Pixtral 12B (pixtral-12b-240910)
Mistral released Pixtral 12B Vision Language Model (pixtral-12b-240910). Some notes on the release below. Installation Mistral common has image support! You can now pass images and URLs alongside text into the user message. To use the model checkpoint: Images You can encode images as follows: Image URLs You can pass image url which will be…