deepseek large language models

2don MSN

Large Language Models Pose Growing Security Risks

More powerful and pervasive large language models are creating a new cybersecurity challenge for companies. The risks posed ...

How DeepSeek used distillation to train its artificial intelligence model, and what it means for companies such as OpenAI

DeepSeek didn't invent distillation, but it woke up the AI world to its disruptive potential. It also ushered in the rise of ...

China's ports adopt DeepSeek AI model to streamline operations, protect data

China's home-grown technological innovations - like the DeepSeek large language model that has taken the artificial ...

1don MSN

DeepSeek: Geopolitical, Technological & a Layman’s view of big AI onset

Deepseek’s models rely on a process called distillation (i.e.) using foundational models like Llama a to train a smaller more ...

The Economist11d

Forget DeepSeek. Large language models are getting cheaper still

As recently as 2022, just building a large language model (LLM) was a feat at the cutting ... In December a Chinese firm, DeepSeek, earned itself headlines for cutting the dollar cost of training ...

Inverse9d

DeepSeek Has Upended The World Of AI In Ways That We’re Only Beginning to Understand

DeepSeek’s disruptive debut comes down ... Developing such powerful AI systems begins with building a large language model. A large language model predicts the next word given previous words.

3don MSN

Baidu's CEO credits DeepSeek for the push to open-source its own AI model

"One thing we learned from DeepSeek is that open-sourcing the best models can greatly help adoption," said Baidu's Robin Li.

Kava AI Launches the World’s Largest Decentralized DeepSeek Model - Finally Making Web3 Effortless

A groundbreaking deAI solution merges best-in-class open-source intelligence with frictionless on-chain transactions, unleashing a new era of user-centric crypto ...

Baidu to open-source its Ernie large language model series

The company, the operator of China’s most popular search engine, announced the plan today. Reuters reported that the ...

Together AI’s $305M bet: Reasoning models like DeepSeek-R1 are increasing, not decreasing, GPU demand

The demands of DeepSeek's advanced reasoning capabilities are pushing enterprises toward Together AI's optimized infrastructure platform.

How test-time scaling unlocks hidden reasoning abilities in small language models (and allows them to outperform LLMs)

A 1B small language model can beat a 405B large language model in reasoning tasks if provided with the right test-time scaling strategy.

AI start-up DeepSeek to open-source 5 code repositories next week for 'full transparency'

DeepSeek said it would double down on open-source technology with a fresh commitment to make five of its code repositories public, as the Chinese start-up continues to draw worldwide attention amid ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results