Technologies » Processors
Features
How Neuromorphic Processing and Self-Searching Storage Can Slash Cyber Risk for Federal Agencies
The amount of information organizations must process at the edge has exploded. This is especially true for federal agencies and the military, which generate enormous quantities of data from mobile devices and sensors in Read more…
Google Claims Its TPU v4 Outperforms Nvidia A100
A new scientific paper from Google details the performance of its Cloud TPU v4 supercomputing platform, claiming it provides exascale performance for machine learning with boosted efficiency. The authors of the resear Read more…
ChatGPT Puts AI At Inflection Point, Nvidia CEO Huang Says
It’s been 11 years since three AI researchers shocked the world with a breakthrough in computer vision, kickstarting the deep learning craze. But with emergence of generative language models like ChatGPT over the past Read more…
Nvidia Unveils GPUs for Generative Inference Workloads like ChatGPT
Today at its GPU Technology Conference, Nvidia took the wraps off three new GPUs designed to accelerate inference workloads for generative AI applications, including generating text, images, and videos. It also launched Read more…
NeuroBlade Seeks Controlled Growth for Big Data Bottleneck-Buster
Early adopters of NeuroBlade’s processing-in-memory (PIM) architecture, called XRAM, are showing a 10x to 60X boost in throughput for big SQL workloads. But the company is playing it safe on the growth front, so don’ Read more…
News In Brief
Samsung to Ship Next-Generation Smart SSD This Year
Samsung will ship a new version of its "smart" SSD for data centers that's more than just a storage device -- it can be a CPU and accelerator when needed. The second-generation SmartSSD computational storage device (C Read more…
Nvidia Launches Hopper H100 GPU, New DGXs and Grace Superchips
The battle for datacenter dominance keeps getting hotter. Today, Nvidia kicked off its spring GTC event with new silicon, new software and a new supercomputer. Speaking from a virtual environment in the Nvidia Omnivers Read more…
Run:ai Seeks to Grow AI Virtualization with $75M Round
Run:ai, a provider of an AI virtualization layer that helps optimize GPU instances, yesterday announced a Series C round worth $75 million. The funding figures to help the fast-growing company expand its sales reach and Read more…
OmniSci Gets HEAVY New Name and New CEO
OmniSci, developers of a GPU-based analytics database, today announced it has changed its name to HEAVY.AI. There’s also a change of leadership at the top of the San Francisco company, as Jon Kondo takes over for co-fo Read more…
This Just In
March 28, 2024 — MLCommons published results of the industry-standard MLPerf v4.0 benchmark for inference. Intel’s results for Intel Gaudi 2 accelerators and 5th Gen Intel Xeon Scalable processors with Intel Advanced Matrix Extensions (Intel AMX) reinforce the company’s commitment to bring “AI Everywhere” Read more…
REDWOOD CITY, Calif., March 21, 2024 — Zilliz is proud to announce the release of Milvus 2.4, setting a new standard in vector search capabilities with a groundbreaking GPU indexing feature powered by NVIDIA’s CUDA-Accelerated Graph Index for Vector Retrieval (CAGRA), part of the RAPIDS cuVS library. Read more…
SAN JOSE, Calif., March 19, 2024 – DDN today demonstrated its reference architecture, which integrates NVIDIA BlueField-3 DPUs into DDN EXAScaler and Infinia storage appliances. Benefits extend to the full end-to-end AI data center stack, synergizing with networking platforms such as NVIDIA Spectrum-X Ethernet for accelerating multi-tenant AI clouds. Read more…
SAN JOSE, Calif., March 19, 2024 – Supermicro, Inc. is announcing its latest portfolio to accelerate the deployment of generative AI. The Supermicro SuperCluster solutions provide foundational building blocks for the present and the future of large language model (LLM) infrastructure. Read more…
NEW YORK, March 18, 2024 — SQream, a leading provider of scalable, GPU-accelerated data analytics software for large data sets and AI/ML workloads, has announced its participation in this year’s NVIDIA GTC AI conference. Read more…