AI EXPRESS - Hot Deal 4 VCs instabooks.co
  • AI
    AI think tank calls GPT-4 a risk to public safety

    AI think tank calls GPT-4 a risk to public safety

    Skillprint launches science-backed platform to match players with the right skill-based games

    Skillprint launches science-backed platform to match players with the right skill-based games

    Got It AI’s ELMAR challenges GPT-4 and LLaMa, scores well on hallucination benchmarks

    Got It AI’s ELMAR challenges GPT-4 and LLaMa, scores well on hallucination benchmarks

    Don't be fooled by AI washing: 3 questions to ask before you invest

    5 ways machine learning must evolve in a difficult 2023

    OpenAI's GPT-4 violates FTC rules, argues AI policy group

    OpenAI’s GPT-4 violates FTC rules, argues AI policy group

    Google advances AlloyDB, BigQuery at Data Cloud and AI Summit

    Google advances AlloyDB, BigQuery at Data Cloud and AI Summit

  • ML
    Recommend top trending items to your users using the new Amazon Personalize recipe

    Recommend top trending items to your users using the new Amazon Personalize recipe

    Snapper provides machine learning-assisted labeling for pixel-perfect image object detection

    Snapper provides machine learning-assisted labeling for pixel-perfect image object detection

    Achieve effective business outcomes with no-code machine learning using Amazon SageMaker Canvas

    Achieve effective business outcomes with no-code machine learning using Amazon SageMaker Canvas

    HAYAT HOLDING uses Amazon SageMaker to increase product quality and optimize manufacturing output, saving $300,000 annually

    HAYAT HOLDING uses Amazon SageMaker to increase product quality and optimize manufacturing output, saving $300,000 annually

    Enable predictive maintenance for line of business users with Amazon Lookout for Equipment

    Enable predictive maintenance for line of business users with Amazon Lookout for Equipment

    Build custom code libraries for your Amazon SageMaker Data Wrangler Flows using AWS Code Commit

    Build custom code libraries for your Amazon SageMaker Data Wrangler Flows using AWS Code Commit

    Access Snowflake data using OAuth-based authentication in Amazon SageMaker Data Wrangler

    Access Snowflake data using OAuth-based authentication in Amazon SageMaker Data Wrangler

    Enable fully homomorphic encryption with Amazon SageMaker endpoints for secure, real-time inferencing

    Enable fully homomorphic encryption with Amazon SageMaker endpoints for secure, real-time inferencing

    Will ChatGPT help retire me as Software Engineer anytime soon? – The Official Blog of BigML.com

    Will ChatGPT help retire me as Software Engineer anytime soon? –

  • NLP
    ChatGPT, Large Language Models and NLP – a clinical perspective

    ChatGPT, Large Language Models and NLP – a clinical perspective

    What could ChatGPT mean for Medical Affairs?

    What could ChatGPT mean for Medical Affairs?

    Want to Improve Clinical Care? Embrace Precision Medicine Through Deep Phenotyping

    Want to Improve Clinical Care? Embrace Precision Medicine Through Deep Phenotyping

    Presight AI and G42 Healthcare sign an MOU

    Presight AI and G42 Healthcare sign an MOU

    Meet Sketch: An AI code Writing Assistant For Pandas

    Meet Sketch: An AI code Writing Assistant For Pandas

    Exploring The Dark Side Of OpenAI's GPT Chatbot

    Exploring The Dark Side Of OpenAI’s GPT Chatbot

    OpenAI launches tool to catch AI-generated text

    OpenAI launches tool to catch AI-generated text

    Year end report, 1 May 2021- 30 April 2022.

    U.S. Consumer Spending Starts to Sputter; Labor Report to Give Fed Look at Whether Rate Increases Are Cooling Rapid Wage Growth

    Meet ETCIO SEA Transformative CIOs 2022 Winner Edmund Situmorang, CIOSEA News, ETCIO SEA

    Meet ETCIO SEA Transformative CIOs 2022 Winner Edmund Situmorang, CIOSEA News, ETCIO SEA

  • Vision
    Data2Vec: Self-supervised general framework

    Data2Vec: Self-supervised general framework

    NVIDIA Metropolis Ecosystem Grows With Advanced Development Tools to Accelerate Vision AI

    NVIDIA Metropolis Ecosystem Grows With Advanced Development Tools to Accelerate Vision AI

    Low Code and No Code Platforms for AI and Computer Vision

    Low Code and No Code Platforms for AI and Computer Vision

    Computer Vision Model Performance Evaluation (Guide 2023)

    Computer Vision Model Performance Evaluation (Guide 2023)

    PepsiCo Leads in AI-Powered Automation With KoiVision Platform

    PepsiCo Leads in AI-Powered Automation With KoiVision Platform

    USB3 & GigE Frame Grabbers for Machine Vision

    USB3 & GigE Frame Grabbers for Machine Vision

    Active Learning in Computer Vision - Complete 2023 Guide

    Active Learning in Computer Vision – Complete 2023 Guide

    Ensembling Neural Network Models With Tensorflow

    Ensembling Neural Network Models With Tensorflow

    Autoencoder in Computer Vision - Complete 2023 Guide

    Autoencoder in Computer Vision – Complete 2023 Guide

  • Robotics
    Keys to using ROS 2 & other frameworks for medical robots

    Keys to using ROS 2 & other frameworks for medical robots

    Watch Bill Gates take a ride in a Wayve AV

    Watch Bill Gates take a ride in a Wayve AV

    Researchers taught a quadruped to use its legs for manipulation

    Researchers taught a quadruped to use its legs for manipulation

    Times Microwave Systems launches coaxial cable for robotics

    Times Microwave Systems launches coaxial cable for robotics

    neubility robot on the sidewalk.

    Sidewalk delivery robot company Neubility secures $2.42M investment

    Gecko Robotics expands work with U.S. Navy

    Gecko Robotics expands work with U.S. Navy

    German robotics industry to grow 9% in 2023

    German robotics industry to grow 9% in 2023

    head shot of larry sweet.

    ARM Institute hires Larry Sweet as Director of Engineering

    Destaco launches end-of-arm tooling line for cobots

    Destaco launches end-of-arm tooling line for cobots

  • RPA
    What is IT Process Automation? Use Cases, Benefits, and Challenges in 2023

    What is IT Process Automation? Use Cases, Benefits, and Challenges in 2023

    Benefits of Automated Claims Processing in Insurance Industry

    Benefits of Automated Claims Processing in Insurance Industry

    ChatGPT and RPA Join Force to Create a New Tech-Revolution

    ChatGPT and RPA Join Force to Create a New Tech-Revolution

    How does RPA in Accounts Payable Enhance Data Accuracy?

    How does RPA in Accounts Payable Enhance Data Accuracy?

    10 Best Use Cases to Automate using RPA in 2023

    10 Best Use Cases to Automate using RPA in 2023

    How will RPA Improve the Employee Onboarding Process?

    How will RPA Improve the Employee Onboarding Process?

    Key 2023 Banking Automation Trends / Blogs / Perficient

    Key 2023 Banking Automation Trends / Blogs / Perficient

    AI-Driven Omnichannel is the Future of Insurance Industry

    AI-Driven Omnichannel is the Future of Insurance Industry

    Avoid Patient Queues with Automated Query Resolution

    Avoid Patient Queues with Automated Query Resolution

  • Gaming
    God of War Ragnarok had a banner debut week at UK retail

    God of War Ragnarok had a banner debut week at UK retail

    A Little To The Left Review (Switch eShop)

    A Little To The Left Review (Switch eShop)

    Horizon Call of the Mountain will release alongside PlayStation VR2 in February

    Horizon Call of the Mountain will release alongside PlayStation VR2 in February

    Sonic Frontiers has Dreamcast-era jank and pop-in galore - but I can't stop playing it

    Sonic Frontiers has Dreamcast-era jank and pop-in galore – but I can’t stop playing it

    Incredible November Xbox Game Pass addition makes all other games obsolete

    Incredible November Xbox Game Pass addition makes all other games obsolete

    Free Monster Hunter DLC For Sonic Frontiers Now Available On Switch

    Free Monster Hunter DLC For Sonic Frontiers Now Available On Switch

    Somerville review: the most beautiful game I’ve ever played

    Somerville review: the most beautiful game I’ve ever played

    Microsoft Flight Sim boss confirms more crossover content like Halo's Pelican and Top Gun Maverick

    Microsoft Flight Sim boss confirms more crossover content like Halo’s Pelican and Top Gun Maverick

    The Game Awards nominations are in, with God of War Ragnarok up for 10 of them

    The Game Awards nominations are in, with God of War Ragnarok up for 10 of them

  • Investment
    Wellth

    Wellth Raises $20M in Series B Funding

    Travelport

    Travelport Receives $200M Investment

    Pulse Industrial

    Pulse Industrial Raises New Funding Round

    Horizon Quantum Computing

    Horizon Quantum Computing Raises USD 18.1M in Series A Funding

    PxE Holographic Imaging Raises $5.4M in Seed Funding

    PxE Holographic Imaging Raises $5.4M in Seed Funding

    Ledger

    Ledger Closes €100M Series C Extension Round

    personal finance

    3 Reliable Ways to Generate Some Income for Investment

    trading

    Index Futures Trading Receives First Ever Crypto Market Deployment on Bitget Exchange

    BioCorteX

    BioCorteX Raises $5M in Seed Funding

  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video
No Result
View All Result
AI EXPRESS - Hot Deal 4 VCs instabooks.co
No Result
View All Result
Home AI

AI Weekly: Novel architectures could make large language models more scalable

seprameen by seprameen
December 19, 2021
in AI
0
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter

Hear from CIOs, CTOs, and different C-level and senior execs on information and AI methods on the Way forward for Work Summit this January 12, 2022. Study extra


Starting in earnest with OpenAI’s GPT-3, the main target within the discipline of pure language processing has turned to giant language fashions (LLMs). LLMs — denoted by the quantity of knowledge, compute, and storage that’s required to develop them — are able to spectacular feats of language understanding, like producing code and writing rhyming poetry. However as an rising variety of research level out, LLMs are impractically giant for many researchers and organizations to reap the benefits of. Not solely that, however they devour an quantity of energy that places into query whether or not they’re sustainable to make use of over the long term.

New analysis means that this needn’t be the case without end, although. In a current paper, Google introduced the Generalist Language Mannequin (GLaM), which the corporate claims is among the best LLMs of its dimension and kind. Regardless of containing 1.2 trillion parameters — practically six occasions the quantity in GPT-3 (175 billion) — Google says that GLaM improves throughout in style language benchmarks whereas utilizing “considerably” much less computation throughout inference.

“Our large-scale … language mannequin, GLaM, achieves aggressive outcomes on zero-shot and one-shot studying and is a extra environment friendly mannequin than prior monolithic dense counterparts,” the Google researchers behind GLaM wrote in a weblog submit. “We hope that our work will spark extra analysis into compute-efficient language fashions.”

Sparsity vs. density

In machine studying, parameters are the a part of the mannequin that’s realized from historic coaching information. Typically talking, within the language area, the correlation between the variety of parameters and class has held up remarkably properly. DeepMind’s just lately detailed Gopher mannequin has 280 billion parameters, whereas Microsoft’s and Nvidia’s Megatron 530B boasts 530 billion. Each are among the many prime — if not the prime — performers on key pure language benchmark duties together with textual content technology.

See also  AI Weekly: Nvidia's commitment to voice AI -- and a farewell

However coaching a mannequin like Megatron 530B requires tons of of GPU- or accelerator-equipped servers and hundreds of thousands of {dollars}. It’s additionally unhealthy for the setting. GPT-3 alone used 1,287 megawatts throughout coaching and produced 552 metric tons of carbon dioxide emissions, a Google examine discovered. That’s roughly equivalent to the yearly emissions of 58 properties within the U.S.

What makes GLaM completely different from most LLMs up to now is its “combination of specialists” (MoE) structure. An MoE may be regarded as having completely different layers of “submodels,” or specialists, specialised for various textual content. The specialists in every layer are managed by a “gating” part that faucets the specialists based mostly on the textual content. For a given phrase or a part of a phrase, the gating part selects the 2 most applicable specialists to course of the phrase or phrase half and make a prediction (e.g., generate textual content).

The complete model of GLaM has 64 specialists per MoE layer with 32 MoE layers in complete, however solely makes use of a subnetwork of 97 billion (8% of 1.2 trillion) parameters per phrase or phrase half throughout processing. “Dense” fashions like GPT-3 use all of their parameters for processing, considerably rising the computational — and monetary — necessities. For instance, Nvidia says that processing with Megatron 530B can take over a minute on a CPU-based on-premises server. It takes half a second on two Nvidia -designed DGX programs, however simply a type of programs can value $7 million to $60 million.

GLaM isn’t good — it exceeds or is on par with the efficiency of a dense LLM in between 80% and 90% (however not all) of duties. And GLaM makes use of extra computation throughout coaching, as a result of it trains on a dataset with extra phrases and phrase elements than most LLMs. (Versus the billions of phrases from which GPT-3 realized language, GLaM ingested a dataset that was initially over 1.6 trillion phrases in dimension.) However Google claims that GLaM makes use of lower than half the energy wanted to coach GPT-3 at 456-megawatt hours (Mwh) versus 1,286 Mwh. For context, a single megawatt is sufficient to energy round 796 properties for a yr.

See also  Preferred Language Among Statisticians - Jammu Kashmir Latest News | Tourism

“GLaM is yet one more step within the industrialization of huge language fashions. The staff applies and refines many trendy tweaks and developments to enhance the efficiency and inference value of this newest mannequin, and comes away with a powerful feat of engineering,” Connor Leahy, an information scientist at EleutherAI, an open AI analysis collective, informed VentureBeat. “Even when there may be nothing scientifically groundbreaking on this newest mannequin iteration, it exhibits simply how a lot engineering effort corporations like Google are throwing behind LLMs.”

Future work

GLaM, which builds on Google’s personal Swap Transformer, a trillion-parameter MoE detailed in January, follows on the heels of different methods to enhance the effectivity of LLMs. A separate staff of Google researchers has proposed fine-tuned language internet (FLAN), a mannequin that bests GPT-3 “by a big margin” on plenty of difficult benchmarks regardless of being smaller (and extra energy-efficient). DeepMind claims that one other of its language fashions, Retro, can beat LLMs 25 occasions its dimension, due to an exterior reminiscence that permits it to lookup passages of textual content on the fly.

In fact, effectivity is only one hurdle to beat the place LLMs are involved. Following related investigations by AI ethicists Timnit Gebru and Margaret Mitchell, amongst others, DeepMind final week highlighted just a few of the problematic tendencies of LLMs, which embrace perpetuating stereotypes, utilizing poisonous language, leaking delicate data, offering false or deceptive data, and performing poorly for minority teams.

Options to those issues aren’t instantly forthcoming. However the hope is that architectures like MoE (and maybe GLaM-like fashions) will make LLMs extra accessible to researchers, enabling them to research potential methods to repair — or as a minimum, mitigate — the worst of the problems.

For AI protection, ship information tricks to Kyle Wiggers — and remember to subscribe to the AI Weekly publication and bookmark our AI channel, The Machine.

Thanks for studying,

Kyle Wiggers

AI Workers Author

Source link

Tags: architectureslanguagelargemodelsscalableWeekly
Previous Post

MassRobotics and AWS Robotics join forces; New Savioke Relay+

Next Post

Fable Raises Series A Funding Round

seprameen

seprameen

Next Post
fable

Fable Raises Series A Funding Round

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Newsletter

Popular Stories

  • Wordle on New York Times

    Today’s Wordle marks the start of a new era for the game – here’s why

    0 shares
    Share 0 Tweet 0
  • iOS 16.4 is rolling out now – here are 7 ways it’ll boost your iPhone

    0 shares
    Share 0 Tweet 0
  • Increasing your daily magnesium intake prevents dementia

    0 shares
    Share 0 Tweet 0
  • Beginner’s Guide for Streaming TV

    0 shares
    Share 0 Tweet 0
  • Twitter’s blue-check doomsday date is set and it’s no April Fool’s joke

    0 shares
    Share 0 Tweet 0

Artificial Intelligence Jobs

View 115 AI Jobs at Tesla

View 165 AI Jobs at Nvidia

View 105 AI Jobs at Google

View 135 AI Jobs at Amamzon

View 131 AI Jobs at IBM

View 95 AI Jobs at Microsoft

View 205 AI Jobs at Meta

View 192 AI Jobs at Intel

Accounting and Finance Hub

Raised Seed, Series A, B, C Funding Round

Get a Free Insurance Quote

Try Our Accounting Service

AI EXPRESS – Hot Deal 4 VCs instabooks.co

AI EXPRESS is a news site that covers the latest developments in Artificial Intelligence, Data Analytics, ML & DL, Algorithms, RPA, NLP, Robotics, Smart Homes & Cities, Cloud & Quantum Computing, AR & VR and Blockchains

Categories

  • AI
  • Ai videos
  • Apps
  • AR & VR
  • Blockchain
  • Cloud
  • Computer Vision
  • Crypto Currency
  • Data analytics
  • Esports
  • Gaming
  • Gaming Videos
  • Investment
  • IOT
  • Iot Videos
  • Low Code No Code
  • Machine Learning
  • NLP
  • Quantum Computing
  • Robotics
  • Robotics Videos
  • RPA
  • Security
  • Smart City
  • Smart Home

Quick Links

  • Reviews
  • Deals
  • Best
  • AI Jobs
  • AI Events
  • AI Directory
  • Industries

© 2021 Aiexpress.io - All rights reserved.

  • Contact
  • Privacy Policy
  • Terms & Conditions

No Result
View All Result
  • AI
  • ML
  • NLP
  • Vision
  • Robotics
  • RPA
  • Gaming
  • Investment
  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video

© 2021 Aiexpress.io - All rights reserved.