-
Apple DCLM-7B Model
Apple has officially entered the language model landscape with the release of a DCML-7B open-source language model, including weights, training code, and dataset. Key Highlights Model Params Tokens Open dataset? CORE MMLU EXTENDED Open weights, closed datasets Llama2 7B 2T ✗ 49.2 45.8 34.1 DeepSeek 7B 2T ✗ 50.7 48.5 35.3 Mistral-0.3 7B ? ✗…
-
Mistral 12B NeMo
Mistral 12B NeMo is a state-of-the-art 12B model with 128k context length, built in collaboration with NVIDIA, and released under the Apache 2.0 license. Today marks the release of Mistral NeMo, a new AI model developed by the Mistral team in collaboration with NVIDIA. Mistral NeMo, a 12B model, features a large context window of…
-
GPT-4o mini
OpenAI is dedicated to making intelligence broadly accessible. Today, they are announcing GPT-4o mini, their most cost-efficient small model. OpenAI anticipates that GPT-4o mini will significantly expand the range of AI applications by making intelligence much more affordable. GPT-4o mini scores 82% on MMLU and currently outperforms GPT-4 in chat preferences on the LMSYS leaderboard.…
-
The “State of Robotics” Report
Coatue just released its “State of Robotics” report covering AI & humanoids. Thanks @KVibhor for sharing it. Six insightful slides you should see below. 1. There’s not enough quality training data to build a general purpose AI model for robots. 2. A market map of the ecosystem 3. The cost of building humanoids is expected…
-
Eureka Labs
Eureka Labs, an AI and education company founded by Andrej Karpathy, is creating a new kind of school that is inherently AI-driven. How can an ideal learning experience be achieved? Imagine learning physics with Richard Feynman guiding you every step of the way. Unfortunately, such passionate and skilled experts, who are infinitely patient and fluent…
-
Claude Android App
In today’s fast-paced world, having a powerful tool at your fingertips can make all the difference. That’s why we’re thrilled to announce the launch of the Claude Android app, bringing Anthropic’s advanced AI assistant directly to your mobile device. Claude is more than just another chatbot. Powered by the cutting-edge Claude 3 model family, this…
-
Codestral Mamba 7B
Mistral releases their first Mamba Model! Codestral Mamba 7B is a Code LLM based on the Mamba2 architecture. Released under Apache 2.0 and achieves 75% on HumanEval for Python Coding. They also released a Math fine-tuning base on Mistral 7B that achieves 56.6% on MATH and 63.47% on MMLU. Mamba Model Details Following the publication…
-
What is Groq LPU?
In the realm of artificial intelligence, speed and efficiency are paramount. Enter Groq’s Language Processing Unit (LPU) – a revolutionary AI inference technology designed to deliver unparalleled compute speed, affordability, and energy efficiency. Groq’s LPU stands as a new category of processor, built from the ground up to meet the specific needs of AI applications,…
-
Lleverage AI: How does it work?
Lleverage AI has successfully raised $2 million in pre-seed funding for its innovative low-code AI development platform. This platform is designed to empower teams to build, test, and deploy AI features into applications seamlessly, without requiring deep AI expertise. Here’s an in-depth look at how Lleverage AI works and the benefits it offers. Create the…
-
Suno
As a music artist, I find myself torn between excitement and apprehension when exploring the capabilities of Suno AI. This advanced music generation tool harnesses artificial intelligence to create music based on user prompts. While the technology is impressive, it raises questions about the future of human creativity in music. Let’s dive into Suno AI,…