
Groq Adapts and Runs LLaMA, the Meta Chatbot Model and Competitor to ChatGPT, for Its Systems
MOUNTAIN VIEW, Calif., March 16, 2023 — Groq, a leading artificial intelligence (AI) and machine learning (ML) systems innovator, last week announced it adapted a new large language model (LLM), LLaMA–chatbot technology from Meta and a proposed alternative to ChatGPT–to run on its systems.
Facebook parent, Meta, released LLaMA, which can be used by chatbots to generate human-like text, on February 24th. Three days later the Groq team downloaded the model and within a few days had it running on a production GroqNode server, including eight GroqChip inference processors. This is a rapid time-to-functionality; a development task that can often take a larger team of engineers weeks to months to complete, while Groq executed with just a small group from its compiler team.
Jonathan Ross, CEO and founder of Groq said, “This speed of development at Groq validates that our generalizable compiler and software-defined hardware approach is keeping up with the accelerating pace of LLM innovation–something traditional kernel-based approaches struggle with.”
The rapid LLaMA bring-up by Groq is a particularly unique and noteworthy milestone because Meta researchers originally developed LLaMA for NVIDIA chips. With Groq engineers successfully running a cutting-edge model on its technology, they demonstrated GroqChip as a ready-to-use alternative to incumbent technology. Generative AI is carving out a place for itself in the market, and as transformers continue to advance the pace of LLM development, customers will need solutions that provide tangible time-to-production advantages, reducing developer complexity for fast iteration.
Bill Xing, Tech Lead Manager, ML Compiler at Groq said, “The complexity of computing platforms is permeating into user code and slowing down innovation. Groq is reversing this trend. Since we’re working on models that were trained on Nvidia GPUs, the first step of porting customer workloads to Groq is removing non-portable, vendor-specific code targeted for specific vendors and architectures. This might include replacing vendor-specific code calling kernels, removing manual parallelism or memory semantics, etc. The resulting code ends up looking a lot simpler and more elegant. Imagine not having to do all that ‘performance engineering’ in the first place to achieve stellar performance. This also helps by not locking a business down to a specific vendor.”
If you would like to discuss your AI strategy and solutions with a technology expert at Groq, please reach out to [email protected]. For press inquiries about this story or Groq technology please contact [email protected].
About Groq
Groq is a technology company delivering ultra-low latency performance and record-breaking inference results for the next era of compute in AI, ML, and HPC. Read our latest customer news in cybersecurity, pharma, and finance.
Source: Groq
March 30, 2023
- Zenoss Launches Free Trial for Kubernetes Monitoring with No Download Required
- Hightouch and Fivetran Accelerate Modern Data Stack Adoption, Exceed 230 Shared Customers
- UNESCO Calls on All Governments to Implement AI Global Ethical Framework Without Delay
- Salesforce Announces New Automotive Cloud Features
- mParticle Launches Warehouse Sync
- Dremio and Domo Announce New Integration to Expand Data Lakehouse Access
- Algolia Introduces New Developer-Friendly ‘Build’ Pricing Plan
- MariaDB’s New SkySQL Release Reimagines How Companies Control Cloud Database Spend
March 29, 2023
- BigID Prepares Organizations for CPRA Compliance with Automated Data Privacy Suite
- Lenovo: Only 15% of Businesses Considered Data Leaders, as Organizations Strive to Enhance Data Strategies to Keep Up with Competitors
- Quantic Selects Couchbase Capella to Scale Point of Sale Platform
- NetApp’s 2023 Cloud Complexity Report Highlights Shifting Demands of a Multicloud Environment
- Alation Secures 2 New Procurement Contracts to Meet Public Sector’s Demand for Data Intelligence
- IBM Cloud and Wasabi Partner to Power Data Insights Across Hybrid Cloud Environments
March 28, 2023
- Sinequa Integrates ChatGPT with Its Neural Search Engine
- D2iQ Brings Cloud-native Deployment Capabilities to Public Sector with New DKP Gov Kubernetes Management Platform
- New Academic Program Lowers Cost for University Researchers to Access Leading-edge ML Method
- Virtana Releases Latest State of Multi-Cloud Management Report
- Hitachi Vantara and Golden Grove Nursery Use Data-driven Analytics for More Sustainable Water Management
- Datametica Brings Pelican SaaS-based Tech to the Google Cloud Marketplace
Most Read Features
- Databricks Bucks the Herd with Dolly, a Slim New LLM You Can Train Yourself
- Prompt Engineer: The Next Hot Job in AI
- Data Mesh Vs. Data Fabric: Understanding the Differences
- Iceberg Data Services Emerge from Tabular, Dremio
- Hallucinations, Plagiarism, and ChatGPT
- GPT-4 Has Arrived: Here’s What to Know
- Open Table Formats Square Off in Lakehouse Data Smackdown
- What Does It Mean for a Data Catalog to Be Powered by a Knowledge Graph?
- Apache Pinot Uncorks Real-Time Data for Ad-Tech Firm
- ChatGPT Brings Ethical AI Questions to the Forefront
- More Features…
Most Read News In Brief
- Observability Primed for a Breakout 2023: Prediction
- Mathematica Helps Crack Zodiac Killer’s Code
- Multi-modal GPT-4 Rumored To Be Released This Week
- Bill Gates Says the Age of AI Has Begun, Bringing Opportunity and Responsibility
- Open Letter Urges Pause on AI Research
- OpenXLA Delivers Flexibility for ML Apps
- Observability Overload: Grafana Labs Survey Builds a Case for Centralized Solutions
- OpenAI’s New GPT-3.5 Chatbot Can Rhyme like Snoop Dogg
- Tool Providers Up Their Data Integration Games
- Microsoft Puts AI into ERP and CRM
- More News In Brief…
Most Read This Just In
- Colossal-AI Releases Open Source Framework for ChatGPT Replication
- Former Cloudera CPO & Hortonworks Cofounder, Arun Murthy, Joins Scale AI
- Salesforce Launches Hyperforce EU Operating Zone
- AWS Announces General Availability of Amazon OpenSearch Serverless
- Akkio Launches Chat Explore Powered by GPT-4
- Salesforce Announces Einstein GPT
- Esri Releases New App to Easily View and Analyze Global Land-Cover Changes
- Vultr Announces Availability of NVIDIA H100 Tensor Core GPU and Partnerships with Domino Data Lab and Anaconda
- Google Cloud and Accenture Expand Strategic Partnership, Announce Platform Tech Integration
- Comet Releases MLOps Industry Report | 2023 Machine Learning Practitioner Survey
- More This Just In…
Sponsored Partner Content
Sponsored Whitepapers
Sponsored Multimedia
Contributors
Featured Events
-
AI in Finance Summit NY
April 20 - April 21New York NY United States -
CDAO Spring 2023
April 25 @ 8:00 am - April 26 @ 5:00 pmSan Francisco CA United States -
AI & Big Data Expo North America 2023
May 17 @ 8:00 am - May 18 @ 5:00 pm -
IEEE Conference on Artificial Intelligence 2023
June 5 @ 8:00 am - June 6 @ 5:00 pmSanta Clara CA United States -
CDAO Insurance 2023
June 13 - June 14