资讯

In the complex mathematical task benchmark tests, researchers calculated K2 Think's average scores in AIME24, AIME25, HMMT25, ...
ERNIE-4.5-21B-A3B-Thinking is available now on Hugging Face under an enterprise-friendly Apache 2.0 license — allowing for commercial usage — and is specifically optimized for advanced reasoning, tool ...
Baidu's Chief Technology Officer, Wang Haifeng, announced the official launch of the X1.1 deep thinking model, which is based ...
They found that when the tasks were not in the training data, the language model failed to achieve those tasks correctly ...
It’s a looming challenge for homeland security as we race to integrate artificial intelligence into command, control, and ...
Dr. James McCaffrey presents a complete end-to-end demonstration of the kernel ridge regression technique to predict a single ...
Deep learning (DL) model training must address the memory bottleneck to continue scaling. Processing-in-memory approaches can be a viable solution as they move computations near or into the memory, ...
Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training - LeslieTrue/SFTvsRL ...
Fault diagnosis with formal languages can be performed in an interpretable way. However, the traditional formal languages cannot deal with noisy environments. Additionally, finding the optimal formal ...
As AI becomes a ubiquitous part of everyday life, people increasingly understand how it works. Whereas traditional computer ...
SimuMax is a distributed training simulator designed for large-scale language model (LLM) workloads. It leverages a static analytical model to simulate and analyze both performance and memory usage, ...