Language Modeling - Search News

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

This release is good for developers building long-context applications, real-time reasoning agents, or those seeking to ...

12h

OpenAI, Mistral AI release new hardware-efficient language models

OpenAI Group PBC and Mistral AI SAS today introduced new artificial intelligence models optimized for cost-sensitive use ...

Forbes

Mistral AI And Nvidia Unveil New Language Model: Mistral NeMo 12B

Forbes contributors publish independent expert analyses and insights. Chief Analyst & CEO, NAND Research. Mistral AI and NVIDIA launched Mistral NeMo 12B, a state-of-the-art language model for ...

EPC Group Expands Power BI Copilot With Enterprise Multi-Model AI Architecture

New architecture integrates Copilot, Azure OpenAI, Claude, and Perplexity to transform Microsoft Power BI into an ...

Communications of the ACM

Measuring What Matters in Large Language Model Performance

As large language models (LLMs) gain momentum worldwide, there’s a growing need for reliable ways to measure their performance. Benchmarks that evaluate LLM outputs allow developers to track ...

World's first Tibetan large language model unveiled in Lhasa

The world's first Tibetan large language model and its application, DeepZang, has been officially unveiled in Lhasa, ...

MIT Technology Review

Anthropic can now track the bizarre inner workings of a large language model

What the firm found challenges some basic assumptions about how this technology really works. The AI firm Anthropic has developed a way to peer inside a large language model and watch what it does as ...

Wired

Small Language Models Are the New Rage, Researchers Say

The original version of this story appeared in Quanta Magazine. Large language models work well because they’re so large. The latest models from OpenAI, Meta, and DeepSeek use hundreds of billions of ...

Semiconductor Engineering

Small Language Models: A Solution To Language Model Deployment At The Edge?

While Large Language Models (LLMs) like GPT-3 and GPT-4 have quickly become synonymous with AI, LLM mass deployments in both training and inference applications have, to date, been predominately cloud ...

Earth.com

How AI learned a complex coding language nobody taught it

Researchers show AI can learn a rare programming language by correcting its own errors, improving its coding success from 39% to 96%.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results