Intel's former CEO puts money into a little-known hardware startup that wants to make Nvidia obsolete
Date:
Tue, 08 Apr 2025 17:32:00 +0000
Description:
UK-based startup Fractile wants to make Nvidia obsolete and is backed by NATO and Intel's former CEO.
FULL STORY ======================================================================UK-based
Fractile is backed by NATO and wants to build faster and cheaper in-memory
AI compute Nvidia's bruteforce GPU approach consumes too much power and is held back by memory Fractile's numbers focused on a cluster of H100 GPU comparison, not the mainstream H200
Nvidia sits comfortably at the top of the AI hardware food chain, dominating the market with its high-performance GPUs and CUDA software stack, which have quickly become the default tools for training and running large AI models - but that dominance comes at a cost - namely, a growing target on its back.
Hyperscalers like Amazon, Google, Microsoft and Meta are pouring resources into developing their own custom silicon in an effort to reduce their dependence on Nvidias chips and cut costs. At the same time, a wave of AI hardware startups is trying to capitalize on rising demand for specialized accelerators, hoping to offer more efficient or affordable alternatives and, ultimately, to displace Nvidia.
You may not have heard of UK-based Fractile yet, but the startup, which
claims its revolutionary approach to computing can run the worlds largest language models 100x faster and at 1/10th the cost of existing systems, has some pretty noteworthy backers, including NATO and the former CEO of Intel, Pat Gelsinger. Removing every bottleneck
We are building the hardware that will remove every bottleneck to the fastest possible inference of the largest transformer networks," Fractile says.
"This means the biggest LLMs in the world running faster than you can read, and a universe of completely new capabilities and possibilities for how we work that will be unlocked by near-instant inference of models with
superhuman intelligence.
Its worth pointing out, before you get too excited, that Fractiles
performance numbers are based on comparisons with clusters of Nvidia H100
GPUs using 8-bit quantization and TensorRT-LLM, running Llama 2 70B - not the newer H200 chips.
In a LinkedIn posting, Gelsinger, who recently joined VC firm Playground Global as a General Partner, wrote, Inference of frontier AI models is bottlenecked by hardware. Even before test-time compute scaling, cost and latency were huge challenges for large scale LLM deployments... To achieve
our aspirations for AI, we will need radically faster, cheaper and much lower power inference.
Im pleased to share that Ive recently invested in Fractile, a UK-founded AI hardware company who are pursuing a path thats radical enough to offer such a leap," he then revealed.
"Their in-memory compute approach to inference acceleration jointly tackles the two bottlenecks to scaling inference, overcoming both the memory bottleneck that holds back todays GPUs, while decimating power consumption, the single biggest physical constraint we face over the next decade in
scaling up data center capacity. In fact, some of the ideas I was exploring
in my graduate work at Stanford University will now come to mainstream AI computing! You might also like Bye Nvidia! Meta tests its first in-house training AI-PU TSMC, Broadcom could tear apart Intel's legendary business after 57 years Nvidia is dreaming of trillion-dollar data centres with millions of GPUs
======================================================================
Link to news story:
https://www.techradar.com/pro/intels-former-ceo-puts-money-into-a-little-known -hardware-startup-that-wants-to-make-nvidia-obsolete
--- Mystic BBS v1.12 A47 (Linux/64)
* Origin: tqwNet Technology News (1337:1/100)