About Us
Highlights FPT Cloud Server FPT AI Factory FPT Network FPT Cloud Backup & DR FPT Storage FPT Security FPT Container FPT Database FPT Cloud Monitoring FPT Data Suite FPT.AI

Show all

Object Storage

Secure, unlimited storage to ensures efficiency as well as high and continuous data access demand.

GPU Server

Virtual server integration for 3D Rendering, AI or ML

FPT Load Balancing

Enhance application capacity and availability.

FPT AI Factory

Access to an all-inclusive stack for AI development, driven by NVIDIA’s powerful technology!

Cloud WAF

FPT Web Application Firewall provides powerful protection for web applications

Cloud Server

Advanced virtual server with rapid scalability

Backup Service

Backup and restore data instantly, securely and maintain data integrity.

Cloud Server

Advanced virtual server with rapid scalability

FPT AI Factory

Access to an all-inclusive stack for AI development, driven by NVIDIA’s powerful technology!

FPT Load Balancing

Enhance application capacity and availability.

Backup Service

Backup and restore data instantly, securely and maintain data integrity.

Disaster Recovery Service

Recovery, ensuring quick operation for the business after all incidents and disasters.

Block Storage

Diverse throughput and capacity to meet various business workloads.

Object Storage

Secure, unlimited storage to ensures efficiency as well as high and continuous data access demand.

Cloud WAF

FPT Web Application Firewall provides powerful protection for web applications

FPT Cloud WAPPLES

Intelligent and Comprehensive Virtual Web Application Firewall - Security Collaboration between FPT Cloud and Penta Security.

Next-Gen Firewall

The Next generation firewall security service

Container Registry

Easily store, manage, deploy, and secure Container images

Kubernetes Engine

Safe, secure, stable, high-performance Kubernetes platform

FPT Database for MongoDB

Provided as a service to deploy, monitor, backup, restore, and scale MongoDB databases on cloud.

FPT Database for Redis

Provided as a service to deploy, monitor, backup, restore, and scale Redis databases on cloud.

PostgreSQL Database Engine

Provided as a service to deploy, monitor, backup, restore, and scale PostgreSQL databases on cloud.

Monitoring

System Monitoring Solution anywhere, anytime, anyplatform

FPT Data Suite

Helps reduce operational costs by up to 40% compared to traditional BI solutions, while improving efficiency through optimized resource usage and infrastructure scaling.
Pricing
Partner
- Tech news
- White Paper
Event

ENG

Dive into Claude Haiku 4.5: Faster, Smarter, and More Affordable

Author: Nguyễn Thùy Dương

16:58 05/11/2025

Table of Contents

After the release of Claude Sonnet 4.5, considered a world-class model for programming and agentic use, Anthropic has introduced its newest small model: Claude Haiku 4.5. According to Anthropic, this model delivers better performance than Sonnet 4, while costing one-third as much and running at more than double the speed.

Claude Haiku 4.5 is engineered for high-volume, low-latency, cost-sensitive deployments. If your workload involves long-running sequences, many calls to LLMs, or you need to spin up multiple agents in parallel, this is a major shift.

1a27d7a85f953c5a0577dc19b507d6e1b93444d5 1920x1080 1

Key technical highlights

Claude Haiku 4.5 is described as a “small, fast model” in Anthropic’s classification. It sits below the “frontier” models but delivers near-frontier coding and reasoning performance at a much lower cost.
029af67124b67bdf0b50691a8921b46252c023d2 1920x1625 1

On SWE-bench Verified (a real-world software engineering test using GitHub issues), Claude Haiku 4.5 scored ~73.3%. By comparison, Claude Sonnet 4.5 scored ~77.2%.

Claude Haiku 4.5 supports both text and image inputs and is capable of extended reasoning, computer-use, and tool-assisted workflows.

The model is available via Claude’s API at USD $1 per 1 million input tokens and $5 per 1 million output tokens. This is significantly lower than higher-tier models.

In terms of safety and alignment, Anthropic assigns Haiku 4.5 under its AI Safety Level 2 (ASL-2) standard, which is a less restrictive classification than the ASL-3 assigned to the bigger models, and reports improved behaviour in alignment benchmarks.

What this means for applications & users

For developers, product teams, and businesses, Claude Haiku 4.5 opens up new possibilities:

Cost-sensitive workflows: When you are running thousands or tens of thousands of model calls (e.g., customer service assistants, chatbots, embedded agents), the lower cost per token matters.
Speed/latency-critical use cases: Claude Haiku 4.5 is faster, so it is well-suited for real-time interaction, multi-agent orchestration, or workflows where response speed is key.
Scaling agents: If you architect a system with a top-tier model as the “brain” and multiple sub-agents handling sub-tasks, Claude Haiku 4.5 offers a faster, cheaper sub-agent tier without sacrificing too much in capability.
Maintain high capability: Claude Haiku 4.5 offers near what was considered cutting-edge only months ago, along with more affordable pricing for many real-world coding, tool-use, and reasoning tasks.
Flexibility in deployment: Claude Haiku 4.5 is available on Claude Code and Anthropic’s apps. Developers can access the model via API and on major cloud platforms (e.g., Amazon Bedrock, Google Cloud’s Vertex AI), making model adoption smoother.

Conclusions

The era when only the most expensive models could deliver top performance is changing. With Claude Haiku 4.5, Anthropic offers a compelling value proposition: remarkable performance, fast speed, and significantly lower cost. For organizations looking to embed AI agents, deploy at scale, or experiment with generative AI workflows, this model opens doors that were previously constrained by budget or latency.
If you are working on AI-powered systems (chatbots, cloud agents, generative workflows), Claude Haiku 4.5 may well allow you to iterate faster, deploy more broadly, and keep your TCO (total cost of ownership) in check.

Source: https://www.anthropic.com/news/claude-haiku-4-5

Maybe you are interested

01.

What’s New on FPT AI Factory

02.

What Are AI Agents? Examples, How they work, How to use them.

03.

Vision-Language Models (VLM) Use Cases for Insurance Company on NVIDIA H100 GPUs

04.

Use Cases for Training Large Language Models (LLMs) with Slurm on Metal Cloud

Maybe you are interested

What’s New on FPT AI Factory

16:39 30/09/2025

What Are AI Agents? Examples, How they work, How to use them.

14:07 22/07/2025

Vision-Language Models (VLM) Use Cases for Insurance Company on NVIDIA H100 GPUs

21:19 11/04/2025

Use Cases for Training Large Language Models (LLMs) with Slurm on Metal Cloud

14:38 21/04/2025

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months
cookielawinfo-checbox-functional	11 months
cookielawinfo-checbox-others	11 months
cookielawinfo-checkbox-necessary	11 months
cookielawinfo-checkbox-performance	11 months
viewed_cookie_policy	11 months