Etched Inc., a developer of artificial intelligence inference chips, launched today with $800 million in funding. The startup ...
Demand for AI inference compute workloads is increasing rapidly, and Nvidia is dominating the market despite competition from ...
ON Semiconductor's fast-growing revenue related to data centers is likely to become a key growth driver for many years to ...
Forbes contributors publish independent expert analyses and insights. I write about the economics of AI. When OpenAI’s ChatGPT first exploded onto the scene in late 2022, it sparked a global obsession ...
Seven months after inking a $20 billion chip licensing deal with Nvidia Corp., Groq Inc. today announced that it has raised ...
A food fight erupted at the AI HW Summit earlier this year, where three companies all claimed to offer the fastest AI processing. All were faster than GPUs. Now Cerebras has claimed insanely fast AI ...
Startup Baseten is reportedly close to finalizing a $1.5 billion round at a $13 billion as the “inference gold rush" marches ...
A significant shift is under way in artificial intelligence, and it has huge implications for technology companies big and small. For the past half-decade, most of the focus in AI has been on training ...
AI inference is undergoing the same transformation that cloud infrastructure experienced a decade ago. Open-weight models have expanded who runs AI — neoclouds, regulated enterprises, and AI-native ...
Perplexity AI unveiled a hybrid local-cloud inference system at Computex 2026 that automatically routes AI tasks between a user’s device and the cloud, signaling a major shift in enterprise AI, ...
AMD is strategically positioned to dominate the rapidly growing AI inference market, which could be 10x larger than training by 2030. The MI300X's memory advantage and ROCm's ecosystem progress make ...