Blog Post

Prmagazine > News > News > Deep Cogito emerges from stealth with hybrid AI ‘reasoning’ models | TechCrunch
Deep Cogito emerges from stealth with hybrid AI ‘reasoning’ models | TechCrunch

Deep Cogito emerges from stealth with hybrid AI ‘reasoning’ models | TechCrunch

A new company Deep CogitoFrom invisibility, a publicly available AI model emerges that can switch between “reasoning” and non-disputed modes.

Inference models such as Openai O1 As they are able to effectively examine their abilities by gradually solving complex problems, they show great hope in fields such as mathematics and physics. However, this reasoning comes at a cost: higher calculations and delays. This is the reason A laboratory like a human A “hybrid” model architecture is being pursued, combining inference components with standard non-conditioning elements. Mixed models can quickly answer simple questions while spending more time thinking about more challenging queries.

All Deep Cogito models, called Cogito 1, are hybrid models. Cogito claims they outperform the best open models of the same size, including Meta and Chinese AI-launched models DeepSeek.

“Each model can answer directly […] Or self-reflection (such as inference models) before answering,” the company Explain in blog post. “[All] Developed by a small team in about 75 days. ”

The Cogito 1 model ranges from 3 billion parameters to 70 billion parameters, and Cogito says models with up to 671 billion parameters will join them in the next few weeks and months. Parameters roughly correspond to the model’s problem-solving skills, where more parameters are usually better.

It can be clearly seen that Cogito 1 was not developed from scratch. Deep Cogito builds on Meta’s open llama and Alibaba’s Qwen model to create its own. The company said it adopts novel training methods to improve the performance of the base model and enable switchable reasoning.

According to the results of Cogito’s internal benchmarks, the largest Cogito 1 model, Cogito 70B, reasoning outperforms DeepSeek’s R1 inference model in some mathematical and linguistic evaluations. Cogito 70b has reason to disability and also releases the Llama 4 Scout model on LiveBench, a universal AI-tested one.

Each Cogito 1 model is available for download or use with the API in cloud provider AI and AI.

Deep Cogito
Compared with other popular public AI models, Cogito 1’s performance,Image source:Deep Cogito

“At present, we are still in [our] “The scaling curves are usually used with a small portion of the calculations, often used in traditional large language models/continuous training,” Cogito wrote in a blog post.

According to documents submitted to CaliforniaSan Francisco-based Deep Cogito was founded in June 2024. The company’s LinkedIn Page Two co-founders are listed, namely Drishan Arora and Dhruv Malhotra. Malhotra was formerly a product manager at Google AI Lab DeepMind, where he worked in generative search technology. Arora is a senior software engineer at Google.

Supporters of Deep Cogito include South Park Commons, According to tonethe ambitious goal is to build “general super intelligence”. The company’s founders understand the phrase means AI, which can perform tasks better than most humans, and “reveals new features we haven’t imagined yet.”

Source link

Leave a comment

Your email address will not be published. Required fields are marked *

star360feedback