Features

How Neuromorphic Processing and Self-Searching Storage Can Slash Cyber Risk for Federal Agencies

The amount of information organizations must process at the edge has exploded. This is especially true for federal agencies and the military, which generate enormous quantities of data from mobile devices and sensors in Read more…

Google Claims Its TPU v4 Outperforms Nvidia A100

A new scientific paper from Google details the performance of its Cloud TPU v4 supercomputing platform, claiming it provides exascale performance for machine learning with boosted efficiency. The authors of the resear Read more…

ChatGPT Puts AI At Inflection Point, Nvidia CEO Huang Says

It’s been 11 years since three AI researchers shocked the world with a breakthrough in computer vision, kickstarting the deep learning craze. But with emergence of generative language models like ChatGPT over the past Read more…

Nvidia Unveils GPUs for Generative Inference Workloads like ChatGPT

Today at its GPU Technology Conference, Nvidia took the wraps off three new GPUs designed to accelerate inference workloads for generative AI applications, including generating text, images, and videos. It also launched Read more…

NeuroBlade Seeks Controlled Growth for Big Data Bottleneck-Buster

Early adopters of NeuroBlade’s processing-in-memory (PIM) architecture, called XRAM, are showing a 10x to 60X boost in throughput for big SQL workloads. But the company is playing it safe on the growth front, so don’ Read more…

News In Brief

Samsung to Ship Next-Generation Smart SSD This Year

Samsung will ship a new version of its "smart" SSD for data centers that's more than just a storage device -- it can be a CPU and accelerator when needed. The second-generation SmartSSD computational storage device (C Read more…

Nvidia Launches Hopper H100 GPU, New DGXs and Grace Superchips

The battle for datacenter dominance keeps getting hotter. Today, Nvidia kicked off its spring GTC event with new silicon, new software and a new supercomputer. Speaking from a virtual environment in the Nvidia Omnivers Read more…

Nvidia Bolsters Edge AI and Autonomous Robots at GTC 2022

Amid the flood of news coming out of Nvidia’s GPU Technology Conference (GTC) today were pair of announcements aimed at accelerating the development of AI on the edge and enabling autonomous mobile robots, or AMRs. Read more…

Run:ai Seeks to Grow AI Virtualization with $75M Round

Run:ai, a provider of an AI virtualization layer that helps optimize GPU instances, yesterday announced a Series C round worth $75 million. The funding figures to help the fast-growing company expand its sales reach and Read more…

OmniSci Gets HEAVY New Name and New CEO

OmniSci, developers of a GPU-based analytics database, today announced it has changed its name to HEAVY.AI. There’s also a change of leadership at the top of the San Francisco company, as Jon Kondo takes over for co-fo Read more…

This Just In

Intel Gaudi 2 Remains Only Benchmarked Alternative to NV H100 for GenAI Performance

Mar 28, 2024 |

March 28, 2024 — MLCommons published results of the industry-standard MLPerf v4.0 benchmark for inference. Intel’s results for Intel Gaudi 2 accelerators and 5th Gen Intel Xeon Scalable processors with Intel Advanced Matrix Extensions (Intel AMX) reinforce the company’s commitment to bring “AI Everywhere” Read more…

Zilliz Introduces Milvus 2.4 with GPU Indexing Support for CAGRA

Mar 21, 2024 |

REDWOOD CITY, Calif., March 21, 2024 — Zilliz is proud to announce the release of Milvus 2.4, setting a new standard in vector search capabilities with a groundbreaking GPU indexing feature powered by NVIDIA’s CUDA-Accelerated Graph Index for Vector Retrieval (CAGRA), part of the RAPIDS cuVS library. Read more…

DDN Integrates NVIDIA BlueField-3 DPUs for Enhanced AI Data Center Efficiency and Performance

Mar 19, 2024 |

SAN JOSE, Calif., March 19, 2024 – DDN today demonstrated its reference architecture, which integrates NVIDIA BlueField-3 DPUs into DDN EXAScaler and Infinia storage appliances. Benefits extend to the full end-to-end AI data center stack, synergizing with networking platforms such as NVIDIA Spectrum-X Ethernet for accelerating multi-tenant AI clouds. Read more…

Supermicro Launches 3 NVIDIA-Based, Full-Stack GenAI SuperClusters That Scale from Enterprise to Large LLM Infrastructures

Mar 19, 2024 |

SAN JOSE, Calif., March 19, 2024 – Supermicro, Inc. is announcing its latest portfolio to accelerate the deployment of generative AI. The Supermicro SuperCluster solutions provide foundational building blocks for the present and the future of large language model (LLM) infrastructure. Read more…

SQream Demonstrates Next-Level AI and ML Data Processing Capabilities at NVIDIA GTC

Mar 18, 2024 |

NEW YORK, March 18, 2024 — SQream, a leading provider of scalable, GPU-accelerated data analytics software for large data sets and AI/ML workloads, has announced its participation in this year’s NVIDIA GTC AI conference. Read more…

Technologies » Processors

Features

News In Brief

This Just In

April 24, 2024

April 23, 2024

April 22, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events