• Landing Page
  • Shop
  • Contact
  • Buy JNews
  • Login
Upgrade
The News Porter
Advertisement
  • Home
  • Page One
  • Exclusive
  • Nation
  • World
  • Fast News
  • Business
  • Entertainment
  • More
    • Education
    • Diaspora
    • Health
    • Legal Angle
    • Science & Tech
    • Press Release
    • The Blog Spot
No Result
View All Result
  • Home
  • Page One
  • Exclusive
  • Nation
  • World
  • Fast News
  • Business
  • Entertainment
  • More
    • Education
    • Diaspora
    • Health
    • Legal Angle
    • Science & Tech
    • Press Release
    • The Blog Spot
No Result
View All Result
The News Porter
No Result
View All Result

DeepSeek may be bad news for some AI companies but great for AI research community

How a small Chinese AI company is shaking up US tech heavyweights

NP Team by NP Team
January 30, 2025
in Fast News, Science & Tech
0
DeepSeek may be bad news for some AI companies but great for AI research community
By Tongliang Liu

Chinese artificial intelligence (AI) company DeepSeek has sent shockwaves through the tech community, with the release of extremely efficient AI models that can compete with cutting-edge products from US companies such as OpenAI and Anthropic.

Founded in 2023, DeepSeek has achieved its results with a fraction of the cash and computing power of its competitors.

DeepSeek’s “reasoning” R1 model, released last week, provoked excitement among researchers, shock among investors, and responses from AI heavyweights. The company followed up on January 28 with a model that can work with images as well as text.

So what has DeepSeek done, and how did it do it?

What DeepSeek did

In December, DeepSeek released its V3 model. This is a very powerful “standard” large language model that performs at a similar level to OpenAI’s GPT-4o and Anthropic’s Claude 3.5.

While these models are prone to errors and sometimes make up their own facts, they can carry out tasks such as answering questions, writing essays and generating computer code. On some tests of problem-solving and mathematical reasoning, they score better than the average human.

V3 was trained at a reported cost of about US$5.58 million. This is dramatically cheaper than GPT-4, for example, which cost more than US$100 million to develop.

DeepSeek also claims to have trained V3 using around 2,000 specialised computer chips, specifically H800 GPUs made by NVIDIA. This is again much fewer than other companies, which may have used up to 16,000 of the more powerful H100 chips.

On January 20, DeepSeek released another model, called R1. This is a so-called “reasoning” model, which tries to work through complex problems step by step. These models seem to be better at many tasks that require context and have multiple interrelated parts, such as reading comprehension and strategic planning.

The R1 model is a tweaked version of V3, modified with a technique called reinforcement learning. R1 appears to work at a similar level to OpenAI’s o1, released last year.

DeepSeek also used the same technique to make “reasoning” versions of small open-source models that can run on home computers.

This release has sparked a huge surge of interest in DeepSeek, driving up the popularity of its V3-powered chatbot app and triggering a massive price crash in tech stocks as investors re-evaluate the AI industry. At the time of writing, chipmaker NVIDIA has lost around US$600 billion in value.

How DeepSeek did it

DeepSeek’s breakthroughs have been in achieving greater efficiency: getting good results with fewer resources. In particular, DeepSeek’s developers have pioneered two techniques that may be adopted by AI researchers more broadly.

The first has to do with a mathematical idea called “sparsity”. AI models have a lot of parameters that determine their responses to inputs (V3 has around 671 billion), but only a small fraction of these parameters is used for any given input.

However, predicting which parameters will be needed isn’t easy. DeepSeek used a new technique to do this, and then trained only those parameters. As a result, its models needed far less training than a conventional approach.

The other trick has to do with how V3 stores information in computer memory. DeepSeek has found a clever way to compress the relevant data, so it is easier to store and access quickly.

What it means

DeepSeek’s models and techniques have been released under the free MIT License, which means anyone can download and modify them.

While this may be bad news for some AI companies – whose profits might be eroded by the existence of freely available, powerful models – it is great news for the broader AI research community.

At present, a lot of AI research requires access to enormous amounts of computing resources. Researchers like myself who are based at universities (or anywhere except large tech companies) have had limited ability to carry out tests and experiments.

More efficient models and techniques change the situation. Experimentation and development may now be significantly easier for us.

For consumers, access to AI may also become cheaper. More AI models may be run on users’ own devices, such as laptops or phones, rather than running “in the cloud” for a subscription fee.

For researchers who already have a lot of resources, more efficiency may have less of an effect. It is unclear whether DeepSeek’s approach will help to make models with better performance overall, or simply models that are more efficient.


The author is an Associate Professor of Machine Learning and Director of the Sydney AI Centre, University of Sydney. This story has been used from The Conversation and The NewsPorter bears no responsibility for the content. (Picture: Wikpedia Commons)
Tags: AnthropicChinese AI company DeepSeekGoogleMetaMicrosoftNVIDIAOpenAIX
Previous Post

Will the Janata Dal (United) remain united until the Assembly elections?

Next Post

Of DeepSeek’s deep impact and AI update for everyone

NP Team

NP Team

Related Posts

Social Justice Set to Dominate Bihar’s Political Battleground 2025
Exclusive

Social Justice Set to Dominate Bihar’s Political Battleground 2025

by Dheeraj Sinha
June 2, 2025
Earth heads for 2.7°C of warming by 2100 — a level that poses an unprecedented threat to life on the planet
Environment

Earth heads for 2.7°C of warming by 2100 — a level that poses an unprecedented threat to life on the planet

by NP Team
May 30, 2025
Florence Nightingale’s story retold: New novel reveals the human side of the nursing pioneer
Books

Florence Nightingale’s story retold: New novel reveals the human side of the nursing pioneer

by NP Team
May 29, 2025
International students are a boon to the US at home and abroad. Trump v Harvard will damage US’s reputation globally
Fast News

International students are a boon to the US at home and abroad. Trump v Harvard will damage US’s reputation globally

by NP Team
May 28, 2025
Tej Pratap in the spotlight—once again, for all the wrong reasons. And no one is shedding tears for him
Exclusive

Tej Pratap in the spotlight—once again, for all the wrong reasons. And no one is shedding tears for him

by Abhay Kumar
May 26, 2025
Next Post
Of DeepSeek’s deep impact and AI update for everyone

Of DeepSeek’s deep impact and AI update for everyone

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

News Categories

  • Art & Culture
  • Blogger's
  • Books
  • Business
  • Cities
  • Diaspora
  • Education
  • Entertainment
  • Environment
  • Exclusive
  • Fast News
  • Foto Feature
  • Globetrotter
  • Health
  • History
  • Interviews
  • Latest News
  • Legal Angle
  • Lifestyle
  • Nation
  • National Panorama
  • Op-Ed
  • Page One
  • Photo of the Day
  • Politics
  • Premium Content
  • Press Release
  • Science & Tech
  • Sports
  • The Blog Spot
  • The Wisdom Tree
  • Travel
  • Trending
  • Uncategorized
  • World
The News Porter

We are a small group of media professionals with rich and diverse experience in Print, TV, and Digital, in
India and abroad.

Quick Links

  • About Us
  • Contact Us
  • Our Team
  • Advertise with us
  • Sponsored Content

Tags

Art & Culture Blogger's Books Business Cities Diaspora Education Entertainment Environment Exclusive Fast News Foto Feature Globetrotter Health History Interviews Latest News Legal Angle Lifestyle Nation National Panorama Op-Ed Page One Photo of the Day Politics Premium Content Press Release Science & Tech Sports The Blog Spot The Wisdom Tree Travel Trending Uncategorized World

Recent Posts

  • Social Justice Set to Dominate Bihar’s Political Battleground 2025
  • Bangkok’s SILQ Hotel & Residence: Tranquil, with the cosy, well-laid out Benjasri Park nearby, and yet vibrant and bustling
  • Earth heads for 2.7°C of warming by 2100 — a level that poses an unprecedented threat to life on the planet
  • Florence Nightingale’s story retold: New novel reveals the human side of the nursing pioneer
  • International students are a boon to the US at home and abroad. Trump v Harvard will damage US’s reputation globally

Copyright 2021 - The News Porter © All rights reserved.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Page One
  • Exclusive
  • Nation
  • World
  • Fast News
  • Business
  • Entertainment
  • More
    • Education
    • Diaspora
    • Health
    • Legal Angle
    • Science & Tech
    • Press Release
    • The Blog Spot

Copyright 2021 - The News Porter © All rights reserved.