Follow Datanami:

Technologies » Processors

Features

How AWS Plans to Cope with GenAI’s Insatiable Desire for Compute

Companies that toyed around with generative AI this year will be looking to play for keeps in 2024 with production GenAI apps that move the business needle. Considering there’s not enough GPUs to go around now, where w Read more…

AWS Teases 65 Exaflop ‘Ultra-Cluster’ with Nvidia, Launches New Chips

AWS yesterday unveiled new EC2 instances geared toward tackling some of the fastest growing workloads, including AI training and big data analytics. During his re:Invent keynote, CEO Adam Selipsky also welcomed Nvidia fo Read more…

Microsoft Rolls Out Host of New AI Features at Ignite 2023

From new copilots and AI development tools, to vector search and AI chips, artificial intelligence featured prominently in Microsoft’s annual Ignite developers conference held this week. It also unveiled some data news Read more…

Alluxio Touts 4X Greater GPU Utilization for AI Training

Customers that use the high-speed cache in the new Alluxio Enterprise AI platform can squeeze up to four times as much work out of their GPU setups than without it, Alluxio announced today. Alluxio also says the overall Read more…

New Language Seeks Reunion of AI Devs, Mojo

A software development kit for Mojo, a new Python-based language for AI development created by former Google engineers, is now available for download on Linux systems, with support for Mac and Windows coming soon, the co Read more…

News In Brief

AWS Introduces a Flurry of New EC2 Instances at re:Invent

AWS has announced three new Amazon Elastic Compute Cloud (Amazon EC2) instances powered by AWS-designed chips, as well as several new Intel-powered instances at its AWS re:Invent 2022 event in Las Vegas. The first new Read more…

IBM Collaboration Looks to Bring Massive AI Models to Any Cloud

Training machine learning foundation models with sometimes billions of parameters demands serious computing power. For example, the largest version of GPT-3, the famous large language model behind OpenAI’s DALL-E 2, ha Read more…

Quine Streaming Graph Processes Over One Million Events Per Second

Quine, a real-time graph data processing engine for ETL pipelines, has reached a performance benchmark of processing one million events per second, according to its maker, thatDot. Named after logician Willard Van Orm Read more…

AMD Previews 400 Gig Adaptive SmartNIC SOC at Hot Chips

Fresh from finalizing its acquisitions of FPGA provider Xilinx (Feb. 2022) and DPU provider Pensando (May 2022), AMD previewed what it calls a 400 Gig Adaptive smartNIC SOC yesterday at Hot Chips. It is another cont Read more…

Samsung Announces 24Gbps GDDR6 DRAM for 30% Faster Speeds

Samsung Electronics announced Thursday it has begun sampling a 16GB Graphics Double Data Rate 6 (GDDR6) DRAM featuring 24-gigabit-per-second processing speeds. The new high-speed memory chips for graphics cards are bu Read more…

This Just In

SQream Launches In-Database Model Training to Boost AI and ML Performance

Apr 10, 2024 |

NEW YORK, April 10, 2024 — SQream announced today that it has launched an ‘in-database model training’ feature, to enable use of their solution by customers as both an integrated analytics platform as well as a machine learning model trainer. Read more…

Intel Launches Gaudi 3 Accelerator, Advancing Enterprise AI with Performance and Openness

Apr 9, 2024 |

PHOENIX, April 9, 2024 — At the Intel Vision 2024 customer and partner conference, Intel introduced the Intel Gaudi 3 accelerator to bring performance, openness and choice to enterprise generative AI (GenAI), and unveiled a suite of new open scalable systems, next-gen products and strategic collaborations to accelerate GenAI adoption. Read more…

Lambda Announces $500M GPU-Backed Facility to Expand Cloud for AI

Apr 4, 2024 |

SAN JOSE, Calif., April 4, 2024 — Lambda, the GPU cloud company founded by AI engineers and powered by NVIDIA GPUs, today announced that it has secured a special purpose GPU financing vehicle of up to $500 million to fund the expansion of its on-demand cloud offering. Read more…

Hailo Secures Additional $120M, Unveils New Hailo-10 GenAI Accelerator

Apr 3, 2024 |

TEL AVIV, Israel, April 3, 2024 — Hailo, a pioneering chipmaker of edge artificial intelligence (AI) processors, announced it has extended its series C fundraising round with an additional investment of $120 million. Read more…

Cloudflare Launches Workers AI, Integrates Hugging Face for Effortless, Global AI Model Deployment

Apr 2, 2024 |

SAN FRANCISCO, April 2, 2024 — Cloudflare, Inc. today announced that developers can now deploy AI applications on Cloudflare’s global network in one simple click directly from Hugging Face, the leading open and collaborative platform for AI builders. Read more…

Datanami