Follow Datanami:

Tag: GPU

Kinetica Elevates RAG with Fast Access to Real-Time Data

Kinetica got its start building a GPU-powered database to serve fast SQL queries and visualizations for US government and military clients. But with a pair of announcements at Nvidia’s GTC show last week, the company i Read more…

Faster Interconnects and Switches to Help Relieve Data Bottlenecks

Nvidia’s new Blackwell architecture may have stolen the show this week at the GPU Technology Conference in San Jose, California. But an emerging bottleneck at the network layer threatens to make bigger and brawnier pro Read more…

Nvidia Introduces New Blackwell GPU for Trillion-Parameter AI Models

Nvidia’s latest and fastest GPU, code-named Blackwell, is here and will underpin the company’s AI plans this year. The chip offers performance improvements from its predecessors, including the red-hot H100 and A100 Read more…

Nvidia Looks to Accelerate GenAI Adoption with NIM

Today at the GPU Technology Conference, Nvidia launched a new offering aimed at helping customers quickly deploy their generative AI applications in a secure, stable, and scalable manner. Dubbed Nvidia Inference Microser Read more…

How HP Was Able to Leapfrog Other PC/Workstation OEMs to Launch its AI Solution

This month, HP stunned the market with the most powerful line of AI-enabled PCs and workstations yet to be created, catching the other OEMs, which have been more tentative and less complete, by surprise. I was at the lau Read more…

Predibase Launches LoRA Land to Rival GPT-4

Predibase, the leading developer platform for fine-tuning LLMs, has introduced LoRA Land, a collection of 25 open-source fine-tuned models that the company claims can challenge or even outperform the hugely popular GPT- Read more…

The Future of AI Is Hybrid

Artificial intelligence today is largely something that occurs in the cloud, where huge AI models are trained and deployed on massive racks of GPUs. But as AI makes its inevitable migration into to the applications and d Read more…

Voltron Aims to Unblock AI with GPU-Accelerated Data Processing

Data is at the heart of artificial intelligence, but it’s also emerging as one of its biggest bottlenecks. Without sufficient quantities of good, clean data to feed into models, companies simply can’t reap the reward Read more…

Top 10 Challenges to GenAI Success

So you want to implement generative AI? That’s great news! You can count yourself among the majority of IT decision makers who have also seen the potential of this transformative tech. While GenAI has the potential to Read more…

2024 GenAI Predictions: Part One

Something unexpected happened in 2023: the world learned about generative AI for the first time. Now that the GenAI cat is out of the bag, companies are in a race to monetize it without succumbing to its negative aspects Read more…

How AWS Plans to Cope with GenAI’s Insatiable Desire for Compute

Companies that toyed around with generative AI this year will be looking to play for keeps in 2024 with production GenAI apps that move the business needle. Considering there’s not enough GPUs to go around now, where w Read more…

Five AWS Predictions as re:Invent 2023 Kicks Off

The computing world this week turns to Amazon Web Services, the company that originally defined the public cloud space and continues to enjoy the largest slice of the cloud pie. Its annual re:Invent conference, which kic Read more…

Pandas on GPU Runs 150x Faster, Nvidia Says

Data scientists and others who work in pandas may be interested to hear about a new release of Nvidia’s RAPIDS cuDF framework that it says results in a 150x performance boost for pandas running atop a GPU. Pandas is Read more…

Anyscale and Nvidia In LLM Hookup

GenAI developers building atop large language models (LLMs) are the big winners of a new partnership between Anyscale and Nvidia unveiled this week that will see the GPU maker’s AI software integrated into Anyscale’s Read more…

How to Manage ML Workflows Like a Netflix Data Scientist

Data scientists want to do data science. It’s right there in the title, after all. But data scientists often are asked to do other things besides building machine learning models, such as creating data pipelines and pr Read more…

ChatGPT Gives Kinetica a Natural Language Interface for Speedy Analytics Database

It would normally take quite a bit of complex SQL to tease a multi-pronged answer out of Kinetica’s high-speed analytics database, which is powered by GPUs but wire-compataible with Postgres. But with the new natural l Read more…

ChatGPT Puts AI At Inflection Point, Nvidia CEO Huang Says

It’s been 11 years since three AI researchers shocked the world with a breakthrough in computer vision, kickstarting the deep learning craze. But with emergence of generative language models like ChatGPT over the past Read more…

Nvidia Unveils GPUs for Generative Inference Workloads like ChatGPT

Today at its GPU Technology Conference, Nvidia took the wraps off three new GPUs designed to accelerate inference workloads for generative AI applications, including generating text, images, and videos. It also launched Read more…

New PyTorch 2.0 Compiler Promises Big Speedup for AI Developers

Machine learning and AI developers are eager to get their hands on PyTorch 2.0, which was unveiled in late 2022 and is due to become available this month. Among the features greeting eager ML developers is a compiler as Read more…

OpenXLA Delivers Flexibility for ML Apps

Machine learning developers gained new abilities to develop and run their ML programs on the framework and hardware of their choice thanks to the OpenXLA Project, which today announced the availability of key open source Read more…

Datanami