AI EXPRESS
  • AI
    AI regulation: A state-by-state roundup of AI bills

    AI regulation: A state-by-state roundup of AI bills

    Iterable optimizes AI to hyper-personalize marketing and predict future purchases

    Iterable optimizes AI to hyper-personalize marketing and predict future purchases

    The future of robotics | VentureBeat

    Nvidia launches new metaverse efforts at SIGGRAPH

    Amazon iRobot play takes ambient intelligence efforts to next level

    Amazon iRobot play takes ambient intelligence efforts to next level

    NNAISENSE announces release of EvoTorch, a rare open-source evolutionary algorithm

    NNAISENSE announces release of EvoTorch, a rare open-source evolutionary algorithm

    What Do You Think Life Will Be In 2050?

    What Do You Think Life Will Be In 2050?

  • ML
    Create Amazon SageMaker model building pipelines and deploy R models using RStudio on Amazon SageMaker

    Create Amazon SageMaker model building pipelines and deploy R models using RStudio on Amazon SageMaker

    MLOps at the edge with Amazon SageMaker Edge Manager and AWS IoT Greengrass

    MLOps at the edge with Amazon SageMaker Edge Manager and AWS IoT Greengrass

    python dictionary append

    Python dictionary append: How to do it?

    Promote feature discovery and reuse across your organization using Amazon SageMaker Feature Store and its feature-level metadata capability

    Promote feature discovery and reuse across your organization using Amazon SageMaker Feature Store and its feature-level metadata capability

    Optimal pricing for maximum profit using Amazon SageMaker

    Optimal pricing for maximum profit using Amazon SageMaker

    Amazon Comprehend announces lower annotation limits for custom entity recognition

    Amazon Comprehend announces lower annotation limits for custom entity recognition

    python __init__

    Python __init__: An Overview – Great Learning

    Scale YOLOv5 inference with Amazon SageMaker endpoints and AWS Lambda

    Scale YOLOv5 inference with Amazon SageMaker endpoints and AWS Lambda

    Simplify iterative machine learning model development by adding features to existing feature groups in Amazon SageMaker Feature Store

    Simplify iterative machine learning model development by adding features to existing feature groups in Amazon SageMaker Feature Store

  • NLP
    abstract image of robot and AI in the supply chain

    AI has Room to Grow in the Supply Chain

    rpa

    RPA gathers steam with Siri-like NLP

    Klangoo FinTech Challenge Winners Announced

    Klangoo FinTech Challenge Winners Announced

    The 10 Best SaaS Companies of 2022 

    The 10 Best SaaS Companies of 2022 

    Real-time Analytics News for Week Ending April 2

    Real-time Analytics News for Week Ending August 6

    You Need To Stop Doing This On Your AI Projects

    You Need To Stop Doing This On Your AI Projects

    Holographic exhibit of Jewish survivors, and more, comes to Aspen

    Holographic exhibit of Jewish survivors, and more, comes to Aspen

    Supply Chain: How AI can bring transparency and visibility to supply chains, improve security and traceability of products

    Supply Chain: How AI can bring transparency and visibility to supply chains, improve security and traceability of products

    Struggling with drug labels data? Why you should consider natural language processing

    Struggling with drug labels data? Why you should consider natural language processing

  • Vision
    Deep Learning for Image Dehazing- The What, Why, and How

    Deep Learning for Image Dehazing- The What, Why, and How

    How to train and use a custom YOLOv7 model

    How to train and use a custom YOLOv7 model

    viso.ai Logo

    Deep Learning for Person Re-Identification (2022)

    NVIDIA Jetson AGX Orin 32GB Production Modules Now Available; Partner Ecosystem Appliances and Servers Arrive

    NVIDIA Jetson AGX Orin 32GB Production Modules Now Available; Partner Ecosystem Appliances and Servers Arrive

    viso.ai Logo

    Guide to Generative Adversarial Networks (GANs) in 2022

    viso.ai Logo

    14 Applications of Computer Vision in Construction (2022 Guide)

    Pattern Matching With Normalised Greyscale Correlation

    Pattern Matching With Normalised Greyscale Correlation

    Filters In Convolutional Neural Networks

    Filters In Convolutional Neural Networks

    Inside the Artificial Intelligence program that creates images from textual descriptions

    Inside the Artificial Intelligence program that creates images from textual descriptions

  • Robotics
    stradvision

    StradVision brings in $88M for autonomous vehicle software

    slamcore

    SLAMcore expands into China, Korea with Intralink

    Waku Robotics secures $1.64M seed round

    Waku Robotics secures $1.64M seed round

    ouster sensors

    LiDAR maker Ouster brings in $10.3M, loses $28M in Q2

    Geek+

    Geek+ raises another $100M for AMRs

    robotire

    RoboTire installs its first system at Discount Tire

    Amazon to acquire iRobot; Robotics at DHL with Sally Miller

    Amazon to acquire iRobot; Robotics at DHL with Sally Miller

    amazon

    Inside Amazon’s robotics ecosystem – The Robot Report

    Amazon buying iRobot for $1.7B

    Amazon buying iRobot for $1.7B

  • RPA
    How to Create a Rock Solid Technology Portfolio with Hyperautomation?| AutomationEdge

    How to Create a Rock Solid Technology Portfolio with Hyperautomation?| AutomationEdge

    Unlocking the Top Healthcare Automation Trends with Use Cases that Rule the World| AutomationEdge

    Unlocking the Top Healthcare Automation Trends with Use Cases that Rule the World| AutomationEdge

    Staying Ahead of the Time with AI-Powered Customer Experience

    Staying Ahead of the Time with AI-Powered Customer Experience| AutomationEdge

    Why is Developing Decision Intelligence with AI Support Crucial in Healthcare?

    Why is Developing Decision Intelligence with AI Support Crucial in Healthcare?

    Robotic Process Automation using Blue Prism

    Robotic Process Automation using Blue Prism

    AI- The Tech Medicine Ameliorating the Healthcare Industry?

    AI- The Tech Medicine Ameliorating the Healthcare Industry?| AutomationEdge

    Take employee experience into hyperdrive with Hyperautomation

    Hyperautomation- Your Answer to Enhance Employee Experience| AutomationEdge

    Know Why Automation Now Resides in the Heart of Customer Contact Centers| AutomationEdge

    Know Why Automation Now Resides in the Heart of Customer Contact Centers| AutomationEdge

    Conversational AI, Healing the Healthcare Industry| AutomationEdge

    Conversational AI, Healing the Healthcare Industry| AutomationEdge

  • Gaming
    Udyr rework revealed in full, as League of Legends' beloved shaman gets a visual and kit upgrade

    Udyr rework revealed in full, as League of Legends’ beloved shaman gets a visual and kit upgrade

    Dragon Quest Builders 2 showed us the potential of Minecraft clones – so where's Dragon Quest Builders 3?

    Dragon Quest Builders 2 showed us the potential of Minecraft clones – so where’s Dragon Quest Builders 3?

    Oops! Nintendo Almost Leaked The Splatoon 3 Direct A Day Early

    Oops! Nintendo Almost Leaked The Splatoon 3 Direct A Day Early

    Pac-Man munching his way onto the silver screen with a live action movie in development

    Pac-Man munching his way onto the silver screen with a live action movie in development

    Elden Ring patch 1.06 brings gifts for heavy weapon users, and White Mask Varre fans who don't care for PvP

    Elden Ring patch 1.06 brings gifts for heavy weapon users, and White Mask Varre fans who don’t care for PvP

    If you want rollback netcode, you’re going to have to play Dragon Ball FighterZ on PS5, Xbox Series X/S, or PC

    If you want rollback netcode, you’re going to have to play Dragon Ball FighterZ on PS5, Xbox Series X/S, or PC

    Star Wars: KOTOR II Premium And Master Physical Editions Revealed For Switch

    Star Wars: KOTOR II Premium And Master Physical Editions Revealed For Switch

    EVO was dominated by rollback netcode announcements, and I couldn't be happier

    EVO was dominated by rollback netcode announcements, and I couldn’t be happier

    Resident Evil Remakes are fine and all - but I’d trade them for more Dead Rising

    Resident Evil Remakes are fine and all – but I’d trade them for more Dead Rising

  • Investment
    Bluestem-Biosciences-Logo

    Bluestem Biosciences Closes $5M Pre-Seed Funding

    salvo health

    Salvo Health Raises $10.5M in Seed Funding

    ReturnLogic

    ReturnLogic Raises $8.5M in Series A Funding

    WiTricity

    WiTricity Closes $63 Million Funding Round

    precitaste

    PreciTaste Raises $24M in Series A Funding

    Oliver Space

    Oliver Space Raises $36M in Funding

    snkrz

    SNKRZ Closes Funding Round

    kargo

    Kargo Buys Ziggeo – FinSMEs

    Mana Interactive Raises Over $7M IN Seed Funding

    DD360 Raises US$25M Equity Investment From Creation Investments

  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video
No Result
View All Result
AI EXPRESS
No Result
View All Result
Home AI

Open source NLP is fueling a new wave of startups

seprameen by seprameen
December 23, 2021
in AI
0
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter

Hear from CIOs, CTOs, and different C-level and senior execs on information and AI methods on the Way forward for Work Summit this January 12, 2022. Be taught extra


Let the OSS Enterprise e-newsletter information your open supply journey! Sign up here.

Giant language fashions able to writing poems, summaries, and pc code are driving the demand for “pure language processing (NLP) as a service.” As these fashions grow to be extra succesful — and accessible, comparatively talking — urge for food within the enterprise for them is rising. In keeping with a 2021 survey from John Snow Labs and Gradient Move, 60% of tech leaders indicated that their NLP budgets grew by not less than 10% in comparison with 2020, whereas a 3rd — 33% — mentioned that their spending climbed by greater than 30%.

Properly-resourced suppliers like OpenAI, Cohere, and AI21 Labs are reaping the advantages. As of March, OpenAI said that GPT-3 was being utilized in greater than 300 totally different apps by “tens of 1000’s” of builders and producing 4.5 billion phrases per day. Traditionally, coaching and deploying these fashions was past the attain of startups with out substantial capital — to not point out compute sources. However the emergence of open supply NLP fashions, datasets, and infrastructure is democratizing the expertise in shocking methods.

Open supply NLP

The hurdles to creating a state-of-the-art language mannequin are vital. These with the sources to develop and practice them, like OpenAI, usually select to not open-source their methods in favor of commercializing them (or solely licensing them). However even the fashions that are open-sourced require immense compute sources to commercialize.

Take, for instance, Megatron 530B, which was collectively created and launched by Microsoft and Nvidia. The mannequin was initially skilled throughout 560 Nvidia DGX A100 servers, every internet hosting 8 Nvidia A100 80GB GPUs. Microsoft and Nvidia say that they noticed between 113 and 126 teraflops per second per GPU whereas coaching Megatron 530B, which might put the coaching price within the hundreds of thousands of {dollars}. (A teraflop score measures the efficiency of {hardware}, together with GPUs.)

Inference — truly operating the skilled mannequin — is one other problem. Getting inferencing (e.g., sentence autocompletion) time with Megatron 530B all the way down to a half a second requires the equal of two $199,000 Nvidia DGX A100 methods. Whereas cloud alternate options could be cheaper, they’re not dramatically so — one estimate pegs the price of operating GPT-3 on a single Amazon Net Providers occasion at a minimal of $87,000 per yr.

Lately, nonetheless, open analysis efforts like EleutherAI have lowered the boundaries to entry. A grassroots assortment of AI researchers, EleutherAI goals to ultimately ship the code and datasets wanted to run a mannequin comparable (although not equivalent) to GPT-3. The group has already launched a dataset known as The Pile that’s designed to coach giant language fashions to finish textual content, write code, and extra. (By the way, Megatron 530B was skilled on The Pile.) And in June, EleutherAI made obtainable below the Apache 2.0 license GPT-Neo and its successor, GPT-J, a language mannequin that performs almost on par with an equivalent-sized GPT-3 mannequin.

One of many startups serving EleutherAI’s fashions as a service is NLP Cloud, which was based a yr in the past by Julien Salinas, a former software program engineer at Hunter.io and the founding father of money-lending service StudyLink.fr. Salinas says the concept got here to him when he realized that, as a programmer, it was turning into and simpler to leverage open supply NLP fashions for enterprise functions however tougher to get them to run correctly in manufacturing.

NLP Cloud

Above: NLP Cloud’s mannequin dashboard.

Picture Credit score: NLP Cloud

NLP Cloud — which has 5 workers — hasn’t raised cash from exterior traders, however claims to be worthwhile.

See also  Reducing crime with better visualisation of data

“Our buyer base is rising quickly, and we see very numerous clients utilizing NLP Cloud — from freelancers to startups and larger tech firms,” Salinas instructed VentureBeat by way of electronic mail. “For instance, we’re at the moment serving to a buyer create a programming professional AI that doesn’t code for you, however — much more importantly— offers you superior details about particular technical fields that you could leverage when creating your software (e.g., as a Go developer, you would possibly wish to learn to use goroutines). We’ve one other buyer who fine-tuned his personal model of GPT-J on NLP Cloud to be able to make medical summaries of conversations between medical doctors and sufferers.”

NLP Cloud competes with Neuro, which serves fashions by way of an API together with EleutherAI’s GPT-J on a pay-per-use foundation. Pursuing higher effectivity, Neuro says it runs a lighter-weight model of GPT-J that also produces “sturdy outcomes” for functions like producing advertising and marketing copy. In one other cost-saving measure, Neuro additionally has clients share cloud GPUs, the ability consumption of which the corporate caps under a sure stage.

“Buyer development has been good. We’ve had many customers put us into their manufacturing surroundings with out having spoken with them — which is wonderful for an enterprise product,” CEO Paul Hetherington instructed VentureBeat by way of electronic mail. “Some individuals have spent over $1,000 of their first day of utilization with integration occasions of minutes in lots of situations. We’ve clients utilizing GPT-J … in quite a lot of methods, together with market copy, producing tales and articles, and producing dialogue for characters in video games or chatbots.”

Neuro, which claims to run all of its compute in-house, has an 11-person group and not too long ago graduated from Y Combinator’s Winter 2021 cohort. Hetherington says that the plan is to proceed to construct out its cloud community and to develop its relationship with EleutherAI.

Neuro

One other EleutherAI mannequin adopter is CoreWeave, which additionally works intently with EleutherAI to coach the group’s bigger fashions. CoreWeave, a cloud service supplier that originally centered on cryptocurrency mining, says that serving NLP fashions is its “largest use case thus far” and at the moment works with clients together with Novel AI, whose AI-powered platform helps customers create tales and embark on text-based adventures.

“We’ve leaned into NLP due to the scale of the market and the void we fill as a cloud supplier,” CoreWeave cofounder and CTO Brian Venturo instructed VentureBeat by way of electronic mail. “I feel we’ve been actually profitable right here due to the infrastructure we constructed, and the price benefits our purchasers see on CoreWeave in comparison with opponents.”

See also  Lack Of Liquidations Could Indicate Another Wave Of Selling

Bias points

No language mannequin is proof against bias and toxicity, as analysis has repeatedly proven. Bigger NLP-as-a-service suppliers have taken a spread of approaches in trying to mitigate the results, from consulting exterior advisory councils to implementing filters that stop clients from utilizing the fashions to generate sure content material, like that pertaining to self-harm.

On the dataset stage, EleutherAI claims to have carried out “intensive bias evaluation” on The Pile and made “powerful editorial choices” to exclude information that they felt have been “unacceptably negatively biased” towards sure teams or views.

NLP Cloud permits clients to add a blacklist of phrases to scale back the danger of producing offending content material with its hosted fashions. In an effort to protect the integrity of the unique fashions, flaws and all, the corporate hasn’t deployed filters or tried to detoxify any of the fashions it serves. However Salinas says that if NLP Cloud does make modifications sooner or later, it’ll be clear about the truth that it has executed so.

“An important threat of toxicity comes from GPT-J as it’s a highly effective AI mannequin for textual content technology, so it must be used responsibly,” Salinas mentioned.

Neither NLP Cloud nor Neuro explicitly prohibit clients from utilizing fashions for probably problematic use circumstances — though each reserve the precise to revoke entry to the fashions for any motive. CoreWeave, for its half, believes that not policing its clients’ functions is a promoting level of its service — however advocates for basic “AI security.”

“[O]ur purchasers fine-tune fashions [to, for instance, reduce toxicity] commonly. This empowers them to ‘re-train’ giant language fashions on a comparatively small information set to make the mannequin extra related to their use case,” Venturo continued. “We don’t at the moment have an out-of-the-box resolution for purchasers to do that, however I’d anticipate that to alter within the coming weeks.”

Hetherington notes that Neuro additionally provides fine-tuning capabilities “with little-to-no programming experience required.”

The trail ahead

Whereas the hands-off method to mannequin moderation won’t sit effectively with each buyer, startups like NLP Cloud, Neuro, and CoreWeave argue that they’re making NLP expertise extra accessible than their better-funded rivals.

For instance, on NLP Cloud, the plan for 3 requests per minute utilizing GPT-J prices $29 per 30 days on a cloud CPU or $99 per 30 days on a GPU — regardless of the variety of tokens (i.e., phrases). Against this, OpenAI fees on a per-token foundation. In the direction of Knowledge Science compared OpenAI’s and NLP Cloud’s choices and located {that a} buyer providing an essay-generating app that receives 10 requests each minute must pay round $2,850 per 30 days in the event that they used one among OpenAI’s less-capable fashions (Curie) versus $699 with NLP Cloud.

Startups constructed on open supply fashions like EleutherAI’s might drive the following wave of NLP adoption. Advisory agency Mordor Intelligence forecasts that the NLP market will greater than triple its income by 2025, as enterprise curiosity in AI rises.

“Deploying these fashions effectively so we will keep an inexpensive pricing, whereas making them dependable with none interruption, is a problem. [But the goal is to provide] a method for builders and information scientists to profit from NLP in manufacturing with out worrying about DevOps,” Salinas mentioned.

Source link

Tags: FuelingNLPopensourceStartupswave
Previous Post

Deep Learning Mask Detection Training Tutorial

Next Post

Healthcare PowerByte: Enhancing Efficiency, Value, and Care

seprameen

seprameen

Next Post
Healthcare PowerByte: Enhancing Efficiency, Value, and Care

Healthcare PowerByte: Enhancing Efficiency, Value, and Care

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Newsletter

Popular Stories

  • Cilium launches eBPF-powered Kubernetes service mesh

    Don’t overengineer your cloud architecture

    0 shares
    Share 0 Tweet 0
  • LG TV Owners Can Get 90 Days Of Stadia Pro For Free

    0 shares
    Share 0 Tweet 0
  • Li Industries Raises $7M in Series A Financing

    0 shares
    Share 0 Tweet 0
  • Redfall is making a 30 minute-long appearance at QuakeCon

    0 shares
    Share 0 Tweet 0
  • New protonic programmable resistors improve AI speed and efficiency

    0 shares
    Share 0 Tweet 0

Artificial Intelligence Jobs

View 115 AI Jobs at Tesla

View 165 AI Jobs at Nvidia

View 105 AI Jobs at Google

View 135 AI Jobs at Amamzon

View 131 AI Jobs at IBM

View 95 AI Jobs at Microsoft

View 205 AI Jobs at Meta

View 192 AI Jobs at Intel

Accounting and Finance Hub

Raised Seed, Series A, B, C Funding Round

Get a Free Insurance Quote

Try Our Accounting Service

AI EXPRESS

AI EXPRESS is a news site that covers the latest developments in Artificial Intelligence, Data Analytics, ML & DL, Algorithms, RPA, NLP, Robotics, Smart Homes & Cities, Cloud & Quantum Computing, AR & VR and Blockchains

Categories

  • AI
  • Ai videos
  • Apps
  • AR & VR
  • Blockchain
  • Cloud
  • Computer Vision
  • Crypto Currency
  • Data analytics
  • Esports
  • Gaming
  • Gaming Videos
  • Investment
  • IOT
  • Iot Videos
  • Low Code No Code
  • Machine Learning
  • NLP
  • Quantum Computing
  • Robotics
  • Robotics Videos
  • RPA
  • Security
  • Smart City
  • Smart Home

Quick Links

  • Reviews
  • Deals
  • Best
  • AI Jobs
  • AI Events
  • AI Directory
  • Industries

© 2021 Aiexpress.io - All rights reserved.

  • Contact
  • Privacy Policy
  • Terms & Conditions

No Result
View All Result
  • AI
  • ML
  • NLP
  • Vision
  • Robotics
  • RPA
  • Gaming
  • Investment
  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video

© 2021 Aiexpress.io - All rights reserved.