Follow Datanami:

Tag: BERT

Are We Nearing the End of ML Modeling?

Josh Tobin, the co-founder and CEO of machine learning tool provider Gantry, didn’t want to believe it at first. But Tobin, who previously worked as a research scientist at OpenAI, eventually came to the conclusion tha Read more…

Large Language Models in 2023: Worth the Hype?

ChatGPT has caught the world’s attention. There’s no denying that. But do large language models, of which ChatGPT is a member, have the staying power to be a transformative force for business in 2023? We once again t Read more…

The Drawbacks of ChatGPT for Production Conversational AI Systems

With its detailed and human-like written responses, ChatGPT has caught the world’s attention and spawned a meaningful discussion about how people should interact with this form of AI. ChatGPT is an upgrade in many ways Read more…

AI Is Coming for White-Collar Jobs, Too

Think AI is just coming for customer service jobs? Think again, say AI experts, who point to recent advances in large language models as evidence that white-collar and professional jobs will be disrupted too. Figuring ou Read more…

Conversational AI Poised to Be Major Disrupter

Chatbots and conversational AI systems got an extended tryout during COVID as companies scrambled for ways to keep their operations running amid lockdowns. The technology fared better than expected, and now is on the cus Read more…

AI Is Not Sentient Yet. But That Doesn’t Mean It’s Not Useful in the Enterprise

Have large language models finally crossed the chasm and become self-aware? A Google researcher recently shocked the world by declaring that Google’s LaMDA has become sentient. Others in the business disagree, saying w Read more…

d-Matrix Gets Funding to Build SRAM ‘Chiplets’ for AI Inference

Hardware startup d-Matrix says the $44 million it raised in a Series A round today will help it continue development of a novel “chiplet” architecture that uses 6 nanometer chip embedded in SRAM memory modules for ac Read more…

Cerebras Hits the Accelerator for Deep Learning Workloads

When it comes to large neural networks like BERT or GPT-3, organizations often must wait weeks or even months for a training task to complete if they’re using traditional CPU and GPU clusters. But with its massive Wafe Read more…

An All-Volunteer Deep Learning Army

A team of researchers says their novel framework, called Distributed Deep Learning in Open Collaborations, or DeDLOC, brings the potential to train large deep learning models from scratch in a distributed, grid-like mann Read more…

FinTech Firm Explores Named Entity Extraction

Founded in 2018, San Francisco-based Digits Financial combines machine learning and analytics to give businesses insights into their transactions, automatically identifying patterns, classifying data, and detecting anoma Read more…

Inside eBay’s Optimization Techniques for Scaling AI

Getting the software right is important when developing machine learning models, such as recommendation or classification systems. But at eBay, optimizing the software to run on a particular piece of hardware using disti Read more…

Unlocking the True Potential of ML: How Self-Supervised Learning in Language Can Beat Human Performance

A core goal for many organizations using artificial intelligence (AI) systems is to have them mirror human language and intelligence. However, mimicking human language and mastering its unique complexities continues to b Read more…

Nvidia Inference Engine Keeps BERT Latency Within a Millisecond

It’s a shame when your data scientists dial in the accuracy on a deep learning model to a very high degree, only to be forced to gut the model for inference because of resource constraints. But that will seldom be the Read more…

Google’s ‘Breakthrough’ LaMDA Promises to Elevate the Common Chatbot

Many of Google’s language processing efforts – like BERT and, more recently, MUM – are focused on returning search queries. But as Google moves more toward Assistant – and search queries in general become more de Read more…

Google’s ‘MUM’ Search AI Aims to Move Beyond Simple Answers

Google’s current search answers may seem complex compared to a few years ago, but to hear the search giant talk about it, this is just the beginning – and there’s a long, long way to go. Now, Google is introducing Read more…

Experts Disagree on the Utility of Large Language Models

Large language models like OpenAI’s GPT-3 and Google Brain’s Switch Transformer have caught the eye of AI experts, who have expressed surprise at the rapid pace of improvement. However, not everybody is jumping onto the bandwagon, and others see significant limitations in the new technology, as well as ethical implications. Read more…

Baidu Releases PaddlePaddle Upgrades

An updated release of Baidu’s deep learning framework includes a batch of new features ranging from inference capabilities for Internet of Things (IoT) applications to a natural language processing (NLP) framework for Read more…

Datanami