Inference Test - Search News

MLCommons releases results of its latest MLPerf AI inference benchmark test

MLCommons today released the latest results of its MLPerf Inference benchmark test, which compares the speed of artificial intelligence systems from different hardware makers. MLCommons is an industry ...

GitHub

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices.

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome). Currently, there are two reference engine ...

GitHub

A learning-focused LLM inference implementation, from CPU baselines to optimized CUDA kernels.

This repository is an LLM inference implementation in C++/CUDA, designed as a learning-focused codebase rather than a production engine. We begin with raw, unoptimized C++ and iteratively apply ...

Gizmochina

NVIDIA GB300 GPUs deliver huge AI efficiency gains in Deepseek R1 inference test

NVIDIA’s latest Blackwell-based GB300 GPUs are starting to show what they can do, and early results point to a massive jump in efficiency compared to the company’s previous generation. A recent ...

VentureBeat

Deci’s NLP model clocks 100,000 queries per second in latest MLPerf results

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Deci, a deep-learning software maker that uses AI-powered tools to help ...

Forbes

The Ladder Of Inference: A Pathway To Better Collaboration

Many theories and tools abound to aid leaders in decision-making. This is because we often find ourselves caught between two perceived poles: following gut instincts or adopting a data-driven approach ...

VentureBeat

How test-time scaling unlocks hidden reasoning abilities in small language models (and allows them to outperform LLMs)

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Very small language models (SLMs) can ...

The New York Times

Wasps Passed This Logic Test. Can You?

The insects frequently found in your backyard appear to be the first invertebrate known to be capable of the skill of transitive inference. By Cara Giaimo Here’s a pop quiz for you. Tom is taller than ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results