Skip to content

Text-To-Text

MaaS-Llama

The MaaS-Llama series of models have high performance and a wide range of application prospects.

The following models are available for purchase:

  • MaaS Llama 3.1 70B Instruct
  • MaaS Llama 3.1 8B Instruct

MaaS Llama 3.1 70B Instruct

MaaS Llama 3.1 70B Instruct is a high-level language model with 70 billion parameters, specifically designed and optimized for instruction-following tasks. This model is based on the latest deep learning techniques and can understand and execute complex instructions, suitable for various natural language processing tasks such as text generation and Q&A.

  • Parameter Count: With 70 billion parameters, it can abstract and generalize complex language patterns and contexts.

  • Instruction Optimization: Specifically optimized for instruction-following tasks, it can accurately execute user instructions.

  • Multi-task Adaptability: Suitable for various natural language processing tasks such as text generation, Q&A, and dialogue.

  • High Accuracy and Reliability: Achieves high accuracy and reliability through large-scale pre-training, capable of providing intelligent language services.

MaaS Llama 3.1 8B Instruct

MaaS Llama 3.1 8B Instruct is a high-level language model with 8 billion parameters, specifically designed and optimized for instruction-following tasks. This model is based on the latest deep learning techniques and can understand and execute complex instructions, suitable for various natural language processing tasks such as text generation and Q&A.

  • Parameter Count: With 8 billion parameters, it guarantees performance while optimizing resource use.

  • Instruction Optimization: Specifically optimized for instruction-following tasks, it can accurately understand and execute instructions.

  • Multi-task Adaptability: Suitable for various natural language processing tasks such as text generation and Q&A.

  • High Efficiency and Reliability: Trained on large-scale data, it possesses high efficiency and reliability, capable of providing intelligent language services.

MaaS-C

MaaS-C is a robust natural language processing model, distinguished by its powerful capabilities in language comprehension and generation. It adeptly understands complex semantic relationships and contextual information, producing high-quality, fluent, and natural text.

The following models are now available for purchase:

  • MaaS-3.5 Sonnet
  • MaaS-3 Sonnet
  • MaaS-3.5 Haiku
  • MaaS-3 Haiku (On demand)

MaaS-3.5 Sonnet

MaaS-3.5 Sonnet is the inaugural version of the MaaS 3.5 series, boasting enhanced speed and superior capabilities in coding, visual interpretation, and natural language understanding.

  • Multifarious Capabilities Surpassing Predecessors

    In various performance tests across reading, programming, mathematics, and visual processing, it demonstrates exceptional proficiency. It shows marked improvement in understanding subtle nuances, humor, and complex instructions, with the ability to create high-quality content in a natural and appropriate tone.

  • Potent Visual Capabilities

    Excelling in tasks involving the interpretation and analysis of visual data, it comprehends complex charts, graphs, and diagrams, analyzes infographics and scientific visualizations, and explains spatial relationships and contexts within scenes. It can seamlessly integrate image and textual information, accurately recognize and describe objects within images, perform visual question answering, and leverage visual data to aid in problem-solving, such as analyzing architectural plans or engineering diagrams. Additionally, it offers insights in art and design analysis, exhibits improved handwriting text recognition from imperfect images, processes various text styles and languages, comprehends the context of text within images, and often retains or describes the original formatting when transcribing structured text.

  • Wide Range of Applications

    It can be employed in customer service, content creation, educational tutoring, programming assistance, data analysis, and other scenarios, potentially giving rise to entirely new business models and services.

MaaS-3 Sonnet

  • Multilingual and Multitasking Capabilities

    Exceling in analysis, prediction, meticulous content creation, code generation, and multilingual conversations, it is capable of summarizing approximately 150,000 words, handling multiple languages, and executing various tasks such as comprehending complex texts and coding.

  • Visual Processing Abilities

    It can manage photos, charts, graphs, and technical diagrams, as well as formatted materials like PDFs, flowcharts, or slides. It accurately identifies and describes objects within images, performs visual question answering, and utilizes visual information to assist in problem-solving, such as analyzing architectural plans or engineering diagrams, and providing insights in art and design analysis.

  • Long-Text Processing Capabilities

    The model can accept texts up to 200,000 words and has the capacity to remember over 1 million tokens.

  • Faster Execution Speed

    With enhanced execution speed, it can handle tasks and respond to user requests more efficiently.

Here’s the translation of the provided text into English:

MaaS 3.5 Haiku

  • Faster Response Speed

MaaS 3.5 Haiku inherits the fast response capability of MaaS 3 Haiku and further enhances it, enabling near real-time response generation. This provides users with a smoother interaction experience, making it particularly suitable for scenarios with high real-time requirements, such as online customer service and real-time decision-making.

  • More Powerful Functions

It possesses enhanced tool usage and reasoning capabilities, allowing for more efficient handling of complex tasks, such as multi-step workflows and tasks requiring integration of external tools. It can also move the cursor, click buttons, and even input text using a virtual keyboard by observing screenshots, operating the computer like a human, which significantly expands its application range in practical work.

  • Overall Performance Improvement

Improvements have been made across various skill domains, with outstanding performance in tasks such as coding, data extraction and labeling, and real-time content review. For example, in code generation, it can provide fast and accurate code suggestions and completions, reducing code-related errors.

  • Multilingual and Visual Capabilities

It also possesses multilingual and visual processing capabilities, enabling it to understand and respond to inputs in multiple languages and analyze and interpret visual information. This makes it suitable for handling multilingual and multi-type data in enterprise scenarios.

  • Enhanced Security

Extensive security assessments were conducted during development, covering various languages and policy areas, which enhanced the model's ability to handle sensitive content. This ensures that the model can provide powerful functionality while strictly adhering to security standards and producing reliable and appropriate content.

MaaS-3 Haiku

  • Rapid Speed

    Capable of reading a paper with dense charts and graphical information (approximately 10,000 tokens) in under three seconds.

  • High Cost-Performance Ratio

    Offers significant cost-effectiveness in the intelligent category market.

  • Enhanced Multilingual and Multitasking Abilities

    Improved capabilities in analysis, prediction, content creation, code generation, and multilingual dialogues.

  • Visual Processing Abilities

    Adept at handling various visual formats, including photos, charts, graphs, and technical illustrations. It can accurately identify and describe objects within images, perform visual question answering, and leverage visual information to assist in problem-solving.

  • Long-Text Processing Abilities

    Supports a context window of 200,000 tokens upon launch and can accept over 1 million tokens as input, with a strong memory capacity to accurately recall information from vast amounts of data.

MaaS-Ge

The MaaS-Ge model is a high-performance, multitasking AI model, renowned for its exceptional precision and efficiency. It adeptly handles a diverse range of tasks, showcasing remarkable adaptability and flexibility. Moreover, the MaaS-Ge model is designed with scalability in mind, allowing it to be effortlessly deployed and optimized across various application scenarios to meet diverse business demands.

The following models are now available for purchase:

  • MaaS-1.0 Pro (On demand)
  • MaaS-1.5 Pro
  • MaaS-1.5 Flash (On demand)

MaaS-1.0 Pro

The MaaS-1.0 Pro exhibits high flexibility and adaptability, excelling in specific fields and application scenarios.

  • Multimodal Capabilities

    Capable of processing inputs from text, images, audio, and video, and generating text and image outputs. For instance, in educational contexts, it can comprehend physics problems, accurately recognize messy handwritten content, understand the structure of questions, convert problems and solutions into mathematical typesetting, identify erroneous steps in students' problem-solving processes, and then provide correct solutions.

  • Long Context Support

    Supports context lengths of up to 32k, enabling better handling of lengthy texts and complex task scenarios.

  • Efficient Attention Mechanisms

    Employs efficient attention mechanisms, such as multi-query attention, to enhance the model's processing efficiency and accuracy.

MaaS-1.5 Pro

  • Exceptional Contextual Processing Capacity

    Capable of handling information containing up to 1 million tokens, it can comprehend extensive documents of up to 1500 pages in one go, summarize 100 emails, process an hour-long video, or manage a codebase exceeding 30,000 lines.

  • Multimodal Input Support

    Proficient in simultaneously processing and understanding text, image, video, and audio data, it excels in handling complex scenarios rich in information, such as video content comprehension and multi-language translation tasks.

  • Outstanding Performance in Complex Tasks

    Displays significant advancements in handling complex prompts and coding tasks, better addressing challenging task scenarios.

  • Efficient Reasoning Capabilities

    Through innovative architecture and training methodologies, it accurately recalls and infers detailed information from extensive contextual data.

  • Promoting Multi-Field Application Development

    Paves the way for breakthroughs in long-document Q&A, long-video Q&A, and long-context automatic speech recognition, while also providing robust support for practical applications in education, research, media, and numerous other fields.

MaaS-1.5 Flash

MaaS-1.5 Flash is a lightweight and efficient AI model with multimodal processing capabilities, capable of handling text, images, audio, and video simultaneously.

  • High-Speed Response

    The optimized architecture enables rapid response when processing large volumes of data, generating output at the fastest speed across all test languages, ensuring users receive answers and results more quickly, thereby enhancing interaction and usage efficiency.

  • Extended Context Window

    Supports contextual processing of up to two million tokens, allowing it to tackle complex tasks such as long video analysis and multi-chapter document generation. For example, it can comprehend multiple extensive documents totaling up to 1500 pages in a single go, summarize 100 emails, process an hour-long video, or manage a codebase exceeding 30,000 lines.

  • Multimodal Processing

    Not only able to handle text, but it also delves deeply into analyzing and understanding images, audio, and video. For instance, it can identify the main content and visual details in videos, analyze and answer questions from image and video inputs, although it still has some shortcomings when processing audio inputs.

  • Versatile Task Application

    Excels in tasks such as summarization, chat applications, providing image captions and video subtitles, and extracting data from long documents and tables.

MaaS-GP系列

MaaS-o1

MaaS o1 model is specifically designed to handle reasoning and problem-solving tasks, with improved specificity and functionality. These models spend more time processing and understanding user requests, and they exhibit exceptional strength in fields such as science, coding, and mathematics compared to their earlier iterations.

Version Description Maximum Request/Tokens Support Status
MaaS o1 preview 2024-09-12 The o1 series features the most powerful model, offering enhanced reasoning capabilities. Input: 128,000
Output: 32,768
Supported
MaaS o1 mini 2024-09-12 The o1 series includes faster and more cost-effective options, making them ideal for coding tasks that require speed and lower resource consumption. Input: 128,000
Output: 65,536
Supported

MaaS-4o

MaaS-4o integrates text and images within a single model, allowing it to simultaneously process multiple data types. This multimodal approach enhances the accuracy and responsiveness of human-computer interactions. Comparable to MaaS-4 Turbo in English text and coding tasks, MaaS-4o surpasses it in performance for non-English languages and visual tasks, setting new benchmarks for AI capabilities.

Version Description Maximum Request/Tokens Support Status
MaaS-4o(2024-08-06)Pro Compared to Maas-4o(2024-08-06),MaaS-4o(2024-08-06)Pro shows a significant improvement in request experimentation. Input:128,000
Output:16,384
Supported
Maas-4.0 (2024-08-06) Maas-4.0 (2024-08-06) contains all the features of the previous version, as well as:
1. Enhanced functionality for structured output extraction.
2. The maximum output token count has increased from 4,096 to 16,384.
Input: 128,000
Output: 16,384
Supported
MaaS-4o mini (2024-07-18) The latest compact GA model
1. A fast, affordable, and powerful model, an ideal replacement for the MaaS 3.5 Turbo series.
2. Text and image processing.
3. JSON mode.
4. Parallel function calling.
5. Enhanced features not supported.
Input: 128,000
Output: 16,384
Supported
MaaS-4o (2024-05-13) The latest large GA model
1. Text and image processing.
2. JSON mode.
3. Parallel function calling.
4. Enhanced accuracy and responsiveness.
5. Comparable to the visual-enabled MaaS-4 Turbo for English text and coding tasks.
6. Superior performance in non-English languages and visual tasks.
7. Enhanced features not supported.
Input: 128,000
Output: 4,096
Supported

MaaS-4 Turbo

MaaS-4 Turbo is a large multimodal model (accepting both text and image inputs to generate text), optimized for chat functionality similarly to MaaS-3.5 Turbo and the earlier MaaS-4 models. It excels in handling conventional completion tasks proficiently.

Version Description Maximum Request/Tokens Support Status
MaaS-4 Turbo (2024-04-09) The latest GA model
1. A replacement for all MaaS-4 preview models (vision-preview, 1106-Preview, 0125-Preview).
2. Feature availability currently varies based on input method and deployment type.
3. Enhanced features not supported.
Input: 128,000
Output: 4,096
Supported

It is a replacement for the following preview models:

  • MaaS-4 version:1106-Preview
  • MaaS-4 version:0125-Preview
  • MaaS-4 version:vision-preview

MaaS-4

MaaS-4 is the predecessor of MaaS-4 Turbo. Both the MaaS-4 model and the MaaS-4 Turbo model share the foundational model name MaaS-4. The distinction between the MaaS-4 model and the Turbo model can be made by examining the model version.

Version Description Maximum Request/Tokens Support Status
MaaS-4 (0125-Preview)
Preview Version of MaaS-4 Turbo
Preview Model
1. Replaced 1106-Preview
2. Enhanced code generation performance
3. Reduced instances of incomplete tasks
4. JSON mode
5. Parallel function calls
6. Reproducible outputs (preview)
Input: 128,000
Output: 4,096
Supported
MaaS-4 (vision-preview)
MaaS-4 Turbo with Vision Capabilities Preview
Preview Model
1. Accepts text and image inputs
2. Supports enhanced features
3. JSON mode
4. Parallel function calls
5. Reproducible outputs (preview)
Input: 128,000
Output: 4,096
Supported
MaaS-4 (1106-Preview)
MaaS-4 Turbo Preview Version
Preview Model
1. JSON mode
2. Parallel function calls
3. Reproducible outputs (preview)
Input: 128,000
Output: 4,096
Supported
MaaS-4-32k (0613) Older GA Model
1. Basic function call using tools
32,768 Upon Request
MaaS-4 (0613) Older GA Model
1. Basic function call using tools
8,192 Upon Request
MaaS-4-32k (0314) Older GA Model 32,768 Upon Request
MaaS-4 (0314) Older GA Model 8,192 Upon Request
  • Compared to MaaS-4-1106-preview, MaaS-4 version 0125-preview more thoroughly accomplishes tasks such as code generation. Hence, depending on the task, clients might find that MaaS-4-0125-preview produces more output than MaaS-4-1106-preview. We recommend that clients compare the outputs of the new model. MaaS-4-0125-preview also addresses a bug in the UTF-8 handling for non-English languages that was present in MaaS-4-1106-preview. MaaS-4 version turbo-2024-04-09 is the latest GA version, superseding 0125-Preview, 1106-preview, and vision-preview.

MaaS-3.5

The MaaS-3.5 model is capable of understanding and generating natural language or code. The most powerful and cost-effective model in the MaaS-3.5 series is MaaS-3.5 Turbo, which has been optimized for chat and excels at traditional completion tasks. MaaS-3.5 Turbo is available for use with the chat completion API. The MaaS-3.5 Turbo instructions provide functionality similar to text-davinci-003 when using the completion API rather than the chat completion API.

Version Description Maximum Request/Tokens Support Status
MaaS-3.5-turbo-0125 Latest GA Model
1. JSON Mode
2. Parallel Function Calls
3. Reproducible Outputs (Preview)
4. Higher Accuracy in Responding in Requested Format
5. Bug Fixes for Text Encoding Issues in Non-English Function Calls
Input: 16,385
Output: 4,096
Supported
MaaS-35-turbo (1106) Previous GA Model
1. JSON Mode
2. Parallel Function Calls
3. Reproducible Outputs (Preview)
Input: 16,385
Output: 4,096
Available on Request
MaaS-35-turbo-instruct (0914) Completion Endpoint Only 4,097 Supported
MaaS-35-turbo-16k (0613) Previous GA Model
1. Basic Function Calls Using Tools
Input: 16,384 Available on Request
MaaS-35-turbo (0613) Previous GA Model
1. Basic Function Calls Using Tools
Input: 4,096 Available on Request
MaaS-5-turbo (0301) Previous GA Model Input: 4,096 Available on Request