site:www.marktechpost.com

Microsoft AI Releases Fara-7B: An Efficient Agentic Model for Computer Use

How do we safely let an AI agent handle real web tasks like booking, searching, and form filling directly on our own devices without sending everything to the cloud? Microsoft Research has released ...

marktechpost

NVIDIA AI Releases Nemotron-Elastic-12B: A Single AI Model that Gives You 6B/9B/12B Variants without Extra Training Cost

Most production systems need several model sizes, a larger model for server side workloads, a mid size model for strong edge GPUs, and a smaller model for tight latency or power budgets. The usual ...

marktechpost

Google vs OpenAI vs Anthropic: The Agentic AI Arms Race Breakdown

In this article we will analyze how Google, OpenAI, and Anthropic are productizing ‘agentic’ capabilities across computer-use control, tool/function calling, orchestration, governance, and enterprise ...

marktechpost

Model Context Protocol (MCP) vs Function Calling vs OpenAPI Tools — When to Use Each?

Orchestration Host routes across many servers/tools App-local chaining Agent/toolkit routes intents → operations ...

marktechpost

MLPerf Inference v5.1 (2025): Results Explained for GPUs, CPUs, and AI Accelerators

decoding MLPerf Inference v5.1 2025 results, scenarios, TTFT/TPOT, power metrics for GPUs, CPUs, accelerators, datacenter, edge ...

marktechpost

Hugging Face Releases Smol2Operator: A Fully Open-Source Pipeline to Train a 2.2B VLM into an Agentic GUI Coder

Hugging Face (HF) has released Smol2Operator, a reproducible, end-to-end recipe that turns a small vision-language model (VLM) with no prior UI grounding into a GUI-operating, tool-using agent. The ...

marktechpost

IBM AI Releases Granite-Docling-258M: An Open-Source, Enterprise-Ready Document AI Model

IBM has released Granite-Docling-258M, an open-source (Apache-2.0) vision-language model designed specifically for end-to-end document conversion. The model targets layout-faithful extraction—tables, ...

marktechpost

The State of Voice AI in 2025: Trends, Breakthroughs, and Market Leaders

The year 2025 marks a turning point for Voice AI Agents, with technology reaching levels of naturalness, context-awareness, and commercial adoption that were unimaginable a decade ago. Powered by ...

marktechpost

What is Agentic RAG? Use Cases and Top Agentic RAG Tools (2025)

Agentic RAG combines the strengths of traditional RAG—where large language models (LLMs) retrieve and ground outputs in external context—with agentic decision-making and tool use. Unlike static ...

marktechpost

Top 10 AI Blogs and News Websites for AI Developers and Engineers in 2025

Staying current with the latest breakthroughs, tools, and industry shifts is critical for AI developers and engineers. To help you cut through the noise, here’s a curated list of the top 10 AI-focused ...

marktechpost

NVIDIA AI Releases ProRLv2: Advancing Reasoning in Language Models with Extended Reinforcement Learning RL

ProRLv2 is the latest version of NVIDIA’s Prolonged Reinforcement Learning (ProRL), designed specifically to push the boundaries of reasoning in large language models (LLMs). By scaling reinforcement ...

marktechpost

MoE Architecture Comparison: Qwen3 30B-A3B vs. GPT-OSS 20B

Qwen3 incorporates a hybrid reasoning system supporting both “thinking” and “non-thinking” modes, allowing users to control computational overhead based on task complexity. The model implements ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results