Text-To-Text

MaaS-DB

Core Capabilities and Performance: Equipped with a 256K context window, it can process long texts exceeding 300,000 Chinese characters, with a maximum chain-of-thought length of 32K.
Technical Features and Cost-Effectiveness: Supports functions such as function calling, structured output, and batch inference. It adopts a tiered pricing model based on input length to reduce enterprises' calling costs.

MaaS-Qwen

Model Scale and Architecture Design: MaaS_Qwen3_coder_480b_a35b_instruct is a large model oriented towards the programming domain. It adopts an efficient architectural design with a total parameter scale of 480 billion, while activating 35 billion parameters during inference. This enables it to operate efficiently while ensuring strong capabilities, with a focus on in-depth optimization for coding scenarios.
Core Programming Capabilities: The model excels in code generation, understanding, debugging, and handling complex programming tasks. It supports multiple programming languages and can meet diverse needs ranging from simple script writing to large-scale project development. It demonstrates particularly strong adaptability in constructing long code logic and handling cross-file associations.
Context and Task Adaptability: It natively supports a 256K token context window, allowing efficient processing of massive code texts. Additionally, it possesses excellent tool-calling capabilities and the ability to decompose multi-step tasks, enabling it to independently plan programming processes and adapt to advanced scenarios such as agent-based programming and complex system development.

MaaS-kimi

The following models are available for on-demand purchase:

MaaS_kimi_k2_0711_preview

MaaS_kimi_k2_0711_preview

All-scenario text generation: Supports various genres including argumentative essays, commercial copy, academic abstracts, etc., and can output scenario-appropriate content according to the given theme and style (e.g., formal, lively).
Accurate instruction execution: With strong semantic understanding capabilities, it accurately captures core requirements in users' prompts (such as word count limits and key points to highlight), reducing deviations between generated content and expectations.
Text optimization & conversational creation: Enables polishing, expansion, and abridgment of existing texts; also supports multi-turn conversational creation, allowing iterative refinement of content through supplementary instructions.

MaaS-grok

Grok is a cutting-edge text generation model developed by xAI, integrating advanced architecture and large-scale training to deliver outstanding performance in the field of text generation. Its key features are as follows:

Diversified text generation: Supports a wide range of text generation tasks including article writing, dialogue generation, and Q&A system construction. Grok is capable of creating stories, generating professional reports, and building intelligent customer service dialogue processes, among other tasks.
Powerful understanding and execution: It accurately comprehends input text and deeply grasps the meaning of complex instructions. Based on prompts, it quickly generates text content that is logically coherent, semantically accurate, and meets requirements. For instance, when handling instructions like "Create a thrilling adventure plot set in a sci-fi background", it can efficiently produce text that meets the specified demands.
Unique style presentation: It possesses a distinct "personality", with responses featuring a sense of humor and "rebellious" spirit. In the fun mode, its language style is lively during interactions with users, allowing it to crack jokes and make witty remarks, creating a relaxed and pleasant conversation atmosphere; in the regular mode, it provides professional and objective answers.

The following models are available for on-demand purchase:

MaaS_Gr_4.1_fast_reasoning_20251118
MaaS_Gr_4.1_fast_non_reasoning_20251118
MaaS_Gr_4_fast_non_reasoning_20250919
MaaS_Gr_4_fast_reasoning_20250919
MaaS_Gr_4_20250709
MaaS_Gr_3_20250218
MaaS_Gr_3_mini_20250218
MaaS_Gr_code_fast_1_20250828

MaaS-DS

The MaaS-DS series models excel in various benchmark tests in text, code, mathematics, and more.

The following models are available for on-demand purchase:

MaaS_DS_V3.2_exp_20250929_thinking
MaaS_DS_V3.2_exp_20250929
MaaS-DS-R1
MaaS-DS-V3
MaaS_DS_V3.1_20250821
MaaS_DS_V3.1_20250821_thinking

MaaS-DS-R1

Specialized Expertise: With long-chain cognitive capabilities, it holds professional expertise in coding and mathematics, ideal for rapidly meeting technical requirements.
Scalability: The model's architecture is designed to be flexible and easily extendable, adapting to different scales of datasets and computing resources.

MaaS-DS-V3

Wide Range of Applications: Applicable in multiple fields such as general knowledge Q&A, text creation, and learning assistance.
Efficient Inference: Utilizing a mixture of experts architecture, it significantly increases output speed, ensuring a swift and smooth user experience.
Multimodal Support: Supports multimodal interactions in text, graphics, audio, and more.
Scalability: Easily extendable, capable of adapting to datasets and computing resources of various scales.

MaaS-C

MaaS-C is a robust natural language processing model, distinguished by its powerful capabilities in language comprehension and generation. It adeptly understands complex semantic relationships and contextual information, producing high-quality, fluent, and natural text.

The following models are now available for purchase:

MaaS_Cl_Opus_4.5_20251124
MaaS_Cl_Haiku_4.5_20251016
MaaS_Cl_sonnet_4.5_20250929
MaaS_Cl_Opus_4.1_20250805
MaaS_Cl_Opus_4_20250514
MaaS_Cl_Sonnet_4_20250514
MaaS_Cl_Sonnet_3.7_20250219
MaaS_Cl_Sonnet_3.5_20241022
MaaS_Cl_Haiku_3.5_20240620

MaaS_Cl_Haiku_4.5

MaaS_Cl_Haiku_4.5 is a lightweight flagship AI model focused on high efficiency and cost-effectiveness. It takes "cutting-edge-approaching intelligent performance and extreme response speed" as its core advantages, achieving a breakthrough balance among performance, efficiency, and cost. Its key features are as follows:

Dual Breakthroughs in High Performance and Ultra-Fast Response: It achieves a 73% accuracy rate in the SWE-bench Verified programming benchmark test, with performance on par with mainstream mid-tier models. It can efficiently complete tasks such as code generation, debugging, and refactoring. Meanwhile, its response speed is more than twice that of its predecessor; the optimized token output efficiency enables it to quickly adapt to real-time interaction scenarios, significantly reducing waiting latency in tasks like code prototype development and instant consultation.
First Support for Extended Thinking and Multi-Step Reasoning: As the first model in the Haiku series with extended thinking capabilities, it can activate internal reasoning processes by enabling dedicated parameters, realizing multi-step decomposition and analysis of complex problems. It supports interleaved thinking between thought summary output and tool calling, and can independently plan multi-tool collaboration processes. Its performance in scenarios such as automated desktop interaction and browser operations even outperforms previous-generation mid-tier models.
Native Context Awareness and Memory Optimization: It has the ability to track token budgets in real time, dynamically sense the remaining context capacity during conversations, and improve task persistence through intelligent management of long-session information. Combined with beta-stage memory tools, it can automatically clean up redundant historical data, effectively handle long-running agent sessions, and reduce the risk of context window overflow.

MaaS_Cl_sonnet_4.5

MaaS_Cl_sonnet_4.5 is a flagship AI model, centered on exceptional programming capabilities. It also excels in agent construction, computer operations, and academic reasoning, making it suitable for complex development, long-term task processing, and professional-level problem-solving scenarios. Its core features are as follows:

Top-tier programming skills: Ranked first in the SWE-bench programming benchmark with an 82.0% accuracy rate, capable of handling full development cycle tasks including code generation, multi-file architecture design, and debugging large codebases. Demonstrates outstanding engineering adaptability and adherence to standards in generated code, with the ability to deeply understand project logic and maintain development continuity.
Support for Efficient Intelligent Agent Construction: Equipped with dedicated tools such as context editing and long-term memory management, it can support the development of complex intelligent agents. For example, it can stably manage task context in a 75-minute game of Settlers of Catan, maintain goal consistency during long-term interactions, and significantly reduce the cost of agent development and operation.
Excellent computer operation skills: Scored 61.4% in the OSWorld benchmark test (an increase of nearly 50% compared to the previous generation), capable of directly performing practical tasks such as browser operations, spreadsheet data organization, and batch document processing. For example, in the context of home renovation, it can independently collect, compile, and analyze renovation budget-related information.
Top-notch reasoning and mathematical ability: Achieved a perfect score in Python mode at the AIME 2025 high school mathematics competition, scored 83.4% in graduate-level reasoning tests, capable of efficiently handling complex formula derivations, logical proofs, and academic problem analysis, suitable for professional scenarios such as research assistance and mathematical modeling.

MaaS_Cl_Opus_4.1

MaaS_Cl_Opus_4.1_20250805 is a flagship AI model focused on professional fields, with core advantages in complex task processing capabilities and high reliability. Its key features are as follows:

Top-tier Coding Engineering Capabilities: It demonstrates outstanding performance in the authoritative SWE-bench Verified test, achieving a 74.5% bug fix success rate. It excels in multi-file code refactoring and debugging of million-line-level codebases, accurately identifying issues and reducing the rate of secondary bug introduction. The engineered code it generates can be directly adapted to development scenarios such as cloud-native environments.
Efficient Long-context and Multimodal Processing: It supports large-scale context windows and accurately captures key information from long texts through a dynamic attention mechanism. Meanwhile, it can integrate and analyze multiple types of content, including tables, code snippets, and images, achieving a 91% detail accuracy rate in ultra-long document parsing.
Autonomous Agent Task Execution: It can decompose complex tasks into executable sub-steps, independently plan processes, and connect to external tools and APIs. When facing anomalies such as data acquisition failures, it can flexibly adjust strategies, increasing the completion rate of complex tasks by 15% and keeping the interruption rate below 5%.

MaaS_Cl_Opus_4

MaaS_Cl_Opus_4_20250514 is a flagship AI model focused on in-depth processing of complex tasks. It takes sustained and stable high performance as well as strong reasoning capabilities as its core advantages, and is particularly suitable for professional development and long-duration collaboration scenarios. Its key features are as follows:

Top-Tier Coding and Engineering Capabilities: It delivers leading performance in authoritative benchmark tests such as SWE-bench and Terminal-bench. It can efficiently complete full-process tasks including multi-file code refactoring, large-scale codebase debugging, and Test-Driven Development (TDD). The generated code features outstanding accuracy and engineering adaptability, and it can deeply understand project specifications while maintaining development consistency.
Hybrid Reasoning and Long-Duration Task Capabilities: It adopts a dual-mode architecture of "standard thinking + extended thinking". In complex scenarios, it can trigger multi-step in-depth reasoning, and balances readability and integrity with the support of a thought summarization mechanism. It supports continuous operation for nearly 7 hours without performance degradation, and maintains the logical consistency of cross-time tasks through implicit knowledge storage, significantly reducing the cost of long-process collaboration.
Ultra-Large Context Window and Memory Optimization: It supports a context window of up to 200,000 tokens, enabling full loading of large-scale project code and documents, as well as dynamic extraction of key information to build an internal knowledge base. Combined with tools such as Files API, it can independently manage long-session information, and maintain compliance with core rules in multi-turn interactions without repeated reminders.
effectively handle long-running agent sessions, and reduce the risk of context window overflow.

MaaS_Cl_sonnet_4

1. Outstanding Performance and Reasoning Capability

Top-tier Coding Strength: MaaS 4 Opus leads in coding ability with a 72.5% score in the SWE-bench test, while MaaS-4 Sonnet also delivers excellent performance.
Advanced Logical Reasoning: Redefines industry standards in complex problem decomposition and in-depth logical thinking.

2. Upgraded Intelligent Agent and Interaction

Autonomous AI Agent System: Executes multi-level complex instructions more efficiently and autonomously, with significantly enhanced task processing workflow capabilities.
Intelligent Tool Collaboration: Supports dynamic invocation of external tools such as search, enabling parallel execution of "extended thinking-tool invocation" to optimize the response chain.
Enhanced Memory Function: After authorizing access to local files, it can generate "memory files" to persistently store dialogue context

3. Reliability and Experience Optimization

Precise Instruction Comprehension: Greatly improves the semantic analysis ability for user needs, significantly reducing instruction execution deviation rates.
Reinforced Behavioral Stability: Speculative operations in agent tasks are reduced by approximately 65% compared to the previous generation with more controllable output.
Innovative Reasoning Transparency: Generates "thinking summaries" through lightweight auxiliary models to visualize complex reasoning processes.
Flexible Dual-Mode Switching: Supports both "near-instant response" and "deep reasoning extended mode" to adapt to different scenario requirements.
effectively handle long-running agent sessions, and reduce the risk of context window overflow.

MaaS_Cl_sonnet_3.7

MaaS 3.7 Sonnet is a state - of - the - art text - generation model. Built upon advanced deep - learning architectures, it is dedicated to the natural language processing domain. This model has the capacity to comprehensively analyze the semantics, context, and emotional undertones of the input text, thereby generating high - quality, logically consistent, and expressive text for users. It finds extensive applications across a wide range of text - creation scenarios.

Robust Semantic Comprehension

Equipped with sophisticated neural network structures, MaaS 3.7 Sonnet meticulously analyzes input texts. It can accurately capture the meaning of each word and sentence, and even detect subtle semantic relationships. Whether dealing with simple daily expressions or complex professional texts, it can achieve in - depth understanding, laying a solid foundation for generating content that precisely meets user requirements.

Exceptional Text Generation Quality

The texts generated by MaaS 3.7 Sonnet are well - structured, with clear logic and natural - flowing sentences. The vocabulary usage is rich and appropriate. Whether for story - writing, copy - writing, or academic discourses, the output content reaches a professional standard, ensuring high readability.

Flexible Application Adaptability

This model can be adapted to various application scenarios. For instance, in content - creation platforms, it helps creators quickly conceive and produce first drafts. In intelligent customer service systems, it provides accurate and user - friendly responses. As a language - learning tool, it can generate practice materials. It fully meets the diverse needs of users.

High - efficiency Computational Performance

MaaS 3.7 Sonnet has been optimized for fast computation. It can respond to user requests rapidly and generate texts in a short time. Even when handling large - scale text - processing tasks, it can complete them efficiently, significantly enhancing the user experience and work efficiency.

effectively handle long-running agent sessions, and reduce the risk of context window overflow.

MaaS_Cl_sonnet_3.5

MaaS-3.5 Sonnet is the inaugural version of the MaaS 3.5 series, boasting enhanced speed and superior capabilities in coding, visual interpretation, and natural language understanding.

Multifarious Capabilities Surpassing Predecessors

In various performance tests across reading, programming, mathematics, and visual processing, it demonstrates exceptional proficiency. It shows marked improvement in understanding subtle nuances, humor, and complex instructions, with the ability to create high-quality content in a natural and appropriate tone.
Potent Visual Capabilities

Excelling in tasks involving the interpretation and analysis of visual data, it comprehends complex charts, graphs, and diagrams, analyzes infographics and scientific visualizations, and explains spatial relationships and contexts within scenes. It can seamlessly integrate image and textual information, accurately recognize and describe objects within images, perform visual question answering, and leverage visual data to aid in problem-solving, such as analyzing architectural plans or engineering diagrams. Additionally, it offers insights in art and design analysis, exhibits improved handwriting text recognition from imperfect images, processes various text styles and languages, comprehends the context of text within images, and often retains or describes the original formatting when transcribing structured text.
Wide Range of Applications

It can be employed in customer service, content creation, educational tutoring, programming assistance, data analysis, and other scenarios, potentially giving rise to entirely new business models and services.

MaaS_Cl_Haiku_3.5

Faster Response Speed

MaaS 3.5 Haiku inherits the fast response capability of MaaS 3 Haiku and further enhances it, enabling near real-time response generation. This provides users with a smoother interaction experience, making it particularly suitable for scenarios with high real-time requirements, such as online customer service and real-time decision-making.

More Powerful Functions

It possesses enhanced tool usage and reasoning capabilities, allowing for more efficient handling of complex tasks, such as multi-step workflows and tasks requiring integration of external tools. It can also move the cursor, click buttons, and even input text using a virtual keyboard by observing screenshots, operating the computer like a human, which significantly expands its application range in practical work.

Overall Performance Improvement

Improvements have been made across various skill domains, with outstanding performance in tasks such as coding, data extraction and labeling, and real-time content review. For example, in code generation, it can provide fast and accurate code suggestions and completions, reducing code-related errors.

Multilingual and Visual Capabilities

It also possesses multilingual and visual processing capabilities, enabling it to understand and respond to inputs in multiple languages and analyze and interpret visual information. This makes it suitable for handling multilingual and multi-type data in enterprise scenarios.

Enhanced Security

Extensive security assessments were conducted during development, covering various languages and policy areas, which enhanced the model's ability to handle sensitive content. This ensures that the model can provide powerful functionality while strictly adhering to security standards and producing reliable and appropriate content.

MaaS-Ge

The MaaS-Ge model is a high-performance, multitasking AI model, renowned for its exceptional precision and efficiency. It adeptly handles a diverse range of tasks, showcasing remarkable adaptability and flexibility. Moreover, the MaaS-Ge model is designed with scalability in mind, allowing it to be effortlessly deployed and optimized across various application scenarios to meet diverse business demands.

The following models are now available for purchase:

MaaS_Ge_3_pro_image_preview_20251120
MaaS_Ge_3_pro_preview_20251118
MaaS_Ge_2.5_flash_lite
MaaS_Ge_2.5_flash
MaaS_Ge_2.5_pro
MaaS_Ge_2.0_flash
MaaS_Ge_1.5_pro

MaaS_Ge_2.5_pro Preview

MaaS_Ge_2.5_pro Preview is a recommended model preview version that showcases superior performance across multiple aspects. The key features include:

Strong reasoning and classification capabilities

It can not only classify categories and perform reasoning, but also aggregate analytical information for generating detailed solutions in mathematics, science, and other fields.
Maximum upload capacity

Supports a maximum of 1 million tokens for text, capable of processing 200 million tokens at the input interface, allowing for the processing of large datasets and complex information, such as complete code libraries.
Natural multimedia integration

Capable of naturally understanding mixed texts, images, videos, and information, outputting various formats such as tables and summaries.
Output adaptability

In the output environment, it can adjust its performance based on specific user requirements, thereby enhancing functional utilization.

MaaS_Ge_2.0_flash

MaaS_Ge_2.0_flash is a powerful and efficient next-generation multimodal artificial intelligence model that seamlessly integrates vision, speech, and text processing capabilities into one.

Equipped with accurate and efficient visual recognition and image understanding technology, it can extract rich information from images.
Possessing outstanding speech recognition and synthesis capabilities, it can engage in seamless human-machine dialogue interactions.
Based on advanced natural language processing technologies, it excels at semantic analysis and text generation.
The three modal capabilities are seamlessly integrated, enabling cross-modal intelligence and ushering in a new era of artificial intelligence.
With robust core computing power and exceptional computational efficiency, it meets the demands of high-intensity multi-task processing.

MaaS_Ge_1.5_pro

Exceptional Contextual Processing Capacity

Capable of handling information containing up to 1 million tokens, it can comprehend extensive documents of up to 1500 pages in one go, summarize 100 emails, process an hour-long video, or manage a codebase exceeding 30,000 lines.
Multimodal Input Support

Proficient in simultaneously processing and understanding text, image, video, and audio data, it excels in handling complex scenarios rich in information, such as video content comprehension and multi-language translation tasks.
Outstanding Performance in Complex Tasks

Displays significant advancements in handling complex prompts and coding tasks, better addressing challenging task scenarios.
Efficient Reasoning Capabilities

Through innovative architecture and training methodologies, it accurately recalls and infers detailed information from extensive contextual data.
Promoting Multi-Field Application Development

Paves the way for breakthroughs in long-document Q&A, long-video Q&A, and long-context automatic speech recognition, while also providing robust support for practical applications in education, research, media, and numerous other fields.

MaaS-GP Series

MaaS_GP_5.1

Version	Description	Support Status
MaaS_GP_5.1	A next-generation general-purpose AI foundation model equipped with an adaptive reasoning engine that dynamically adjusts thinking depth based on task complexity. It supports text, image, and audio multimodal processing, delivering smarter and more empathetic outputs while maintaining fast response speeds.	Supported
MaaS_GP_5.1_chat	An optimized version for interactive conversation scenarios, featuring more natural dialogue flow and emotional understanding capabilities. It introduces a chain-of-thought mechanism for the first time and offers 8 preset conversation styles (e.g., friendly, professional, straightforward), making AI interactions more human-like and better at following user instructions accurately.	Supported
MaaS_GP_5.1_codex	A programming model tailored for software development, deeply optimized for code generation, debugging, and large-scale project development. It supports multi-file consistency maintenance and can work continuously for extended periods (over 24 hours in internal tests), making it ideal for professional programming scenarios such as full application construction, complex refactoring, and security audits.	Supported
MaaS_GP_5.1_codex_mini	A lightweight programming assistant model that retains core coding capabilities while significantly improving response speed and reducing resource consumption. It is particularly suitable for fast code snippet generation, small-scale projects, and high-frequency simple programming tasks, providing a cost-effective development support experience.	Supported

MaaS_GP_5

Version	Description	Support Status
MaaS_GP_5	A high - end large - scale language model with strong comprehensive capabilities, suitable for complex tasks such as in - depth text generation, multi - scenario reasoning, and advanced language interaction, aiming to provide high - quality and diverse language services.	Supported
MaaS_GP_5_mini	A lightweight variant of GPT - 5, focusing on meeting relatively simple language processing needs with lower resource consumption, suitable for scenarios like basic text generation and daily dialogue interaction.	Supported
MaaS_GP_5_nano	An ultra - lightweight version, designed for extremely resource - constrained environments or simple quick language processing tasks, emphasizing high efficiency and conciseness in a narrow range of applications.	Supported
MaaS_GP_5_codex	Flagship model dedicated to programming, capable of multilingual code generation, debugging, and refactoring, in-depth analysis of codebases at the scale of hundreds of millions of lines, and integration with cross-platform toolchains, suitable for the entire enterprise software development process.	Supported
MaaS_GP_5_pro	An all-in-one flagship model that supports deep reasoning across multiple domains, multi-modal interaction, and complex task decomposition and execution. It accommodates extremely long contexts and tool collaboration, making it suitable for professional scenarios such as scientific research and business decision-making.	Pending support
MaaS_GP_5_chat	A lightweight flagship model focused on conversational interaction, centered on natural and fluent multi-turn communication, contextualized responses, and real-time information integration capabilities, suitable for high-frequency interaction scenarios such as daily communication and light office collaboration.	Supported

MaaS-4.1

The MaaS-4.1 series includes MaaS_4.1, MaaS_4.1 mini, and MaaS_4.1 nano three model types, supporting up to 1 million tokens for text generation, with strong coding capabilities, high model robustness, and multi-task processing advantages.

Version	Description	Support Status
MaaS_4.1	Smartest model for complex tasks	Supported
MaaS_4.1_mini	Affordable model balancing speed and intelligence	Supported
MaaS_4.1_nano	Fastest, most cost-effective model for low-latency tasks	Supported

MaaS-o*

MaaS o* model is specifically designed to handle reasoning and problem-solving tasks, with improved specificity and functionality. These models spend more time processing and understanding user requests, and they exhibit exceptional strength in fields such as science, coding, and mathematics compared to their earlier iterations.

Version	Description	Maximum Request/Tokens	Support Status
MaaS o4 mini	This inference model is faster and more cost - effective, demonstrating excellent performance in mathematics, coding, and vision.	Input: 128,000 Output: 16,000	Supported
MaaS o3 pro_20250610	Response API；Structured Output；Text/Image Processing；Functions/Tools.	Input: 200,000 Output: 100,000	Supported
MaaS o3 mini 2025-01-31	The latest reasoning model, offering enhanced reasoning abilities.	Input: 200,000 Output: 100,000	Supported
MaaS o1-1217-Global	The most capable model in the o1 series, offering enhanced reasoning abilities.	Input: 200,000 Output: 100,000	Supported
MaaS o1 preview 2024-09-12	Older preview version.	Input: 128,000 Output: 32,768	Supported
MaaS o1 mini 2024-09-12	A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption.	Input: 128,000 Output: 65,536	Supported

MaaS-4o

MaaS-4o integrates text and images within a single model, allowing it to simultaneously process multiple data types. This multimodal approach enhances the accuracy and responsiveness of human-computer interactions. Comparable to MaaS-4 Turbo in English text and coding tasks, MaaS-4o surpasses it in performance for non-English languages and visual tasks, setting new benchmarks for AI capabilities.

Version	Description	Maximum Request/Tokens	Support Status
MaaS-4o（2024-11-20）	Latest large GA model.	Input:128,000 Output:16,384	Supported
MaaS-4o（2024-08-06）Pro	Compared to Maas-4o(2024-08-06)，MaaS-4o（2024-08-06）Pro shows a significant improvement in request experimentation.	Input:128,000 Output:16,384	Supported
Maas-4.0 (2024-08-06)	Maas-4.0 (2024-08-06) contains all the features of the previous version, as well as: 1. Enhanced functionality for structured output extraction. 2. The maximum output token count has increased from 4,096 to 16,384.	Input: 128,000 Output: 16,384	Supported
MaaS-4o mini (2024-07-18)	The latest compact GA model 1. A fast, affordable, and powerful model, an ideal replacement for the MaaS 3.5 Turbo series. 2. Text and image processing. 3. JSON mode. 4. Parallel function calling. 5. Enhanced features not supported.	Input: 128,000 Output: 16,384	Supported
MaaS-4o (2024-05-13)	The latest large GA model 1. Text and image processing. 2. JSON mode. 3. Parallel function calling. 4. Enhanced accuracy and responsiveness. 5. Comparable to the visual-enabled MaaS-4 Turbo for English text and coding tasks. 6. Superior performance in non-English languages and visual tasks. 7. Enhanced features not supported.	Input: 128,000 Output: 4,096	Supported

MaaS-4 Turbo

MaaS-4 Turbo is a large multimodal model (accepting both text and image inputs to generate text), optimized for chat functionality similarly to MaaS-3.5 Turbo and the earlier MaaS-4 models. It excels in handling conventional completion tasks proficiently.

Version	Description	Maximum Request/Tokens	Support Status
MaaS-4 Turbo (2024-04-09)	The latest GA model 1. A replacement for all MaaS-4 preview models (vision-preview, 1106-Preview, 0125-Preview). 2. Feature availability currently varies based on input method and deployment type. 3. Enhanced features not supported.	Input: 128,000 Output: 4,096	Supported

It is a replacement for the following preview models:

MaaS-4 version：1106-Preview
MaaS-4 version：0125-Preview
MaaS-4 version：vision-preview

MaaS-4

MaaS-4 is the predecessor of MaaS-4 Turbo. Both the MaaS-4 model and the MaaS-4 Turbo model share the foundational model name MaaS-4. The distinction between the MaaS-4 model and the Turbo model can be made by examining the model version.

Version	Description	Maximum Request/Tokens	Support Status
MaaS-4 (0125-Preview) Preview Version of MaaS-4 Turbo	Preview Model 1. Replaced 1106-Preview 2. Enhanced code generation performance 3. Reduced instances of incomplete tasks 4. JSON mode 5. Parallel function calls 6. Reproducible outputs (preview)	Input: 128,000 Output: 4,096	Supported
MaaS-4 (vision-preview) MaaS-4 Turbo with Vision Capabilities Preview	Preview Model 1. Accepts text and image inputs 2. Supports enhanced features 3. JSON mode 4. Parallel function calls 5. Reproducible outputs (preview)	Input: 128,000 Output: 4,096	Supported
MaaS-4 (1106-Preview) MaaS-4 Turbo Preview Version	Preview Model 1. JSON mode 2. Parallel function calls 3. Reproducible outputs (preview)	Input: 128,000 Output: 4,096	Supported
MaaS-4-32k (0613)	Older GA Model 1. Basic function call using tools	32,768	Upon Request
MaaS-4 (0613)	Older GA Model 1. Basic function call using tools	8,192	Upon Request
MaaS-4-32k (0314)	Older GA Model	32,768	Upon Request
MaaS-4 (0314)	Older GA Model	8,192	Upon Request

Compared to MaaS-4-1106-preview, MaaS-4 version 0125-preview more thoroughly accomplishes tasks such as code generation. Hence, depending on the task, clients might find that MaaS-4-0125-preview produces more output than MaaS-4-1106-preview. We recommend that clients compare the outputs of the new model. MaaS-4-0125-preview also addresses a bug in the UTF-8 handling for non-English languages that was present in MaaS-4-1106-preview. MaaS-4 version turbo-2024-04-09 is the latest GA version, superseding 0125-Preview, 1106-preview, and vision-preview.

MaaS-3.5

The MaaS-3.5 model is capable of understanding and generating natural language or code. The most powerful and cost-effective model in the MaaS-3.5 series is MaaS-3.5 Turbo, which has been optimized for chat and excels at traditional completion tasks. MaaS-3.5 Turbo is available for use with the chat completion API. The MaaS-3.5 Turbo instructions provide functionality similar to text-davinci-003 when using the completion API rather than the chat completion API.

Version	Description	Maximum Request/Tokens	Support Status
MaaS-3.5-turbo-0125	Latest GA Model 1. JSON Mode 2. Parallel Function Calls 3. Reproducible Outputs (Preview) 4. Higher Accuracy in Responding in Requested Format 5. Bug Fixes for Text Encoding Issues in Non-English Function Calls	Input: 16,385 Output: 4,096	Supported
MaaS-35-turbo (1106)	Previous GA Model 1. JSON Mode 2. Parallel Function Calls 3. Reproducible Outputs (Preview)	Input: 16,385 Output: 4,096	Available on Request
MaaS-35-turbo-instruct (0914)	Completion Endpoint Only	4,097	Supported
MaaS-35-turbo-16k (0613)	Previous GA Model 1. Basic Function Calls Using Tools	Input: 16,384	Available on Request
MaaS-35-turbo (0613)	Previous GA Model 1. Basic Function Calls Using Tools	Input: 4,096	Available on Request
MaaS-5-turbo (0301)	Previous GA Model	Input: 4,096	Available on Request