-
Llama 3.1 405B
Starting today, open source is leading the way. Meta introducing Llama 3.1 405b their most capable models yet. Today Meta releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context window and improved support for 8 languages among other improvements. Llama…
-
Nvidia Minitron 4B / 8B
Nvidia releases Minitron 4B & 8B – iteratively pruning and distilling 2-4x smaller models from large LLMs, requiring 40x fewer training tokens and with 16% improvement on MMLU! Distilled model (w/ pruning + retraining) beats teacher! Best practices: And many more in the paper below. Minitron 8B Base Minitron is a family of small language…
-
ElevenLabs Reader App (iOS / Android)
The ElevenLabs Reader App is now on iOS and Android. Choose from hundreds of high quality AI voices to narrate any article, PDF, or ePub on the go. It’s only available in the US, UK, and Canada today. The app will launch worldwide in the next few weeks once ElevenLabs Dev Team add support for…
-
RTVI-AI: Real-time Voice and Video Inference
Today Daily announcing an open standard for Real-time Voice and Video Inference: RTVI-AI. The RTVI abstractions and data structures define how client applications communicate with inference services. These are the “real-time APIs” for use cases like: Daily Team shipping open source reference JavaScript and React SDKs today, with iOS, Android and other platform SDKS coming…
-
Open Source AI News [Mon 22 July]
Probably the craziest week in Open Source AI (yet): There’s a lot more; Arcee (mergekit) released a series of LLMs, each better than the other, and Numina and HF Numina 72B (based on Qwen 2) and Math datasets, Mixbread with embedding models (english + german) and a lot more! It’s fun to see so many…
-
StockBot
@GroqInc is incredibly fast, currently up to 1200+ tokens/second. But what can you do with that speed? Introducing StockBot, a lightning fast AI chatbot powered by Llama3-70b on Groq that responds with live stock charts, financials, news, and screeners. All open source! StockBot is an app that leverages GroqInc’s speed, @vercel’s AI SDK, and @tradingview’s…
-
Apple DCLM-7B Model
Apple has officially entered the language model landscape with the release of a DCML-7B open-source language model, including weights, training code, and dataset. Key Highlights Model Params Tokens Open dataset? CORE MMLU EXTENDED Open weights, closed datasets Llama2 7B 2T ✗ 49.2 45.8 34.1 DeepSeek 7B 2T ✗ 50.7 48.5 35.3 Mistral-0.3 7B ? ✗…
-
Mistral 12B NeMo
Mistral 12B NeMo is a state-of-the-art 12B model with 128k context length, built in collaboration with NVIDIA, and released under the Apache 2.0 license. Today marks the release of Mistral NeMo, a new AI model developed by the Mistral team in collaboration with NVIDIA. Mistral NeMo, a 12B model, features a large context window of…
-
GPT-4o mini
OpenAI is dedicated to making intelligence broadly accessible. Today, they are announcing GPT-4o mini, their most cost-efficient small model. OpenAI anticipates that GPT-4o mini will significantly expand the range of AI applications by making intelligence much more affordable. GPT-4o mini scores 82% on MMLU and currently outperforms GPT-4 in chat preferences on the LMSYS leaderboard.…
-
The “State of Robotics” Report
Coatue just released its “State of Robotics” report covering AI & humanoids. Thanks @KVibhor for sharing it. Six insightful slides you should see below. 1. There’s not enough quality training data to build a general purpose AI model for robots. 2. A market map of the ecosystem 3. The cost of building humanoids is expected…