Author: Sixpences
-
Chinese AI Giants DeepSeek and Alibaba Release Powerful New Open-Source Models
Written by
on
In a significant development for the open-source AI community, Chinese AI companies DeepSeek and Alibaba quietly released major model updates within hours of each other, both demonstrating impressive capabilities that…
-
Chinese AI Agent Manus Makes Splash, Sparks Both Wonder and Controversy
Written by
on
A new AI agent called Manus has emerged from China in recent days, claiming to be the world’s first general-purpose AI agent. The product has quickly gone viral across Chinese…
-
Alibaba’s QwQ-32B: DeepSeek R1 Performance with 1/21 of the Parameters
Written by
on
In a significant advancement for AI efficiency, Alibaba’s Qwen team has open-sourced QwQ-32B, a large language model that achieves comparable performance to much larger models while dramatically reducing computational costs.…
-
Is DeepSeek’s 545% Profit Margin a Game-Changer for AI Computing?
Written by
on
DeepSeek made waves in the AI industry last week with its “five consecutive bombshells” during open-source week, culminating in a shocking revelation: a theoretical profit margin of 545%, with their…
-
DeepSeek Unveils DeepGEMM: 300-Line Code Powers V3 and R1 Models
Written by
on
In its third open-source release of the week, DeepSeek-AI has launched DeepGEMM, an innovative FP8 General Matrix Multiplication (GEMM) acceleration library designed for maximum performance with minimal code complexity. This…
-
DeepSeek Accelerates Release of New AI Model R2, Targeting OpenAI’s o3
Written by
on
Reuters reports that Chinese AI company DeepSeek is accelerating the release of its new R2 model, potentially launching before its originally planned May debut. The R2 model is expected to…
-
DeepSeek Releases DeepEP: An Efficient Communication Library for MoE Models
Written by
on
February 25, 2025 DeepSeek has released DeepEP, the second open-source project in its “Open Source Week” initiative. This new library provides an efficient expert-parallel communication system designed specifically for Mixture-of-Experts…
-
DeepSeek Launches Open Source Week with FlashMLA Release
Written by
on
DeepSeek initiated its “Open Source Week” on February 24, 2025, by open-sourcing FlashMLA, an efficient MLA decoding kernel optimized for NVIDIA’s Hopper GPUs. This release marks the beginning of the…
-
Alibaba Announces Record-Breaking $52 Billion Investment in Cloud and AI Infrastructure
Written by
on
Alibaba Group has unveiled an unprecedented investment plan, committing 380 billion yuan ($52 billion) over the next three years to build cloud computing and artificial intelligence infrastructure. This investment surpasses…
-
Major Banks Raise Alibaba’s Target Price Amid AI-Driven Growth
Written by
on
Multiple leading investment banks have significantly raised their target prices for Alibaba Group, reflecting growing confidence in the company’s AI and cloud computing strategy. Goldman Sachs has increased its 12-month…