Skip to content

Overview

This product manual provides a comprehensive overview of the MaaS API system. The system is designed to simplify and optimize API usage, offering cost-effective API services. Through seamless integration with multiple API providers, users can flexibly choose API interfaces from various vendors. The system charges based on the number of API requests, eliminating the need for complex configurations such as subscriptions and quotas.

The MaaS API system aims to provide enterprise users with a user-friendly interface for efficient access and utilization of APIs, thereby achieving more economical API usage. Users can flexibly select different API interfaces according to their actual needs, and the system will charge based on usage, simplifying the complexity of API management.

Background

As intelligent language models continue to evolve, the MaaS API model has gained widespread recognition among enterprise users. However, due to the reliance on advanced models provided by external companies, enterprise versions utilizing MaaS API may face certain challenges. To address this issue, the MaaS API system aims to provide a user-friendly interface, facilitating efficient and economical use of intelligent language models for businesses. The system simplifies complex configuration management by charging based on the number of API requests, eliminating the need for users to consider issues such as subscriptions and quotas, thereby allowing them to focus on business applications.

Models

The MaaS API system supports a wide range of models for different tasks, including:

  • Text-to-Text models: These models are designed to generate text based on a given input. They can be used for tasks such as language translation, summarization, and question answering.

  • Text-to-Image models: These models can generate images based on a given text input. They are useful for tasks such as image captioning and text-based image synthesis.

  • Embedding models:The embedded model transforms high-dimensional information into low-dimensional vector representations, providing strong and efficient foundational support for various natural language processing tasks.

  • Text-to-Speech models: These models can convert text into speech or audio output. They are often used in voice assistants, audiobook generation, and accessibility applications.

  • Speech-to-Text models:The speech-to-text model swiftly and accurately converts voice input into written text, significantly enhancing the efficiency of information processing and recording.

  • Text-to-Video models:The text-to-video model generates corresponding dynamic videos based on written text, greatly enriching the form and appeal of content presentation.

Text-to-Text models

Model Description Supported
MaaS o1 A large model with excellent reasoning capabilities, trained in a chain of thought format, adept at handling complex reasoning. It performs exceptionally well in fields such as science, programming, and mathematics. Yes
MaaS-Llama Skilled at handling multi-language tasks, with strong contextual abilities. Excels in content creation, conversational AI, language understanding, and possesses self-correction capabilities. Yes
MaaS-4o & MaaS-4 Turbo NEW The latest most capable Azure OpenAI models with multimodal versions, which can accept both text and images as input Yes
MaaS-4 A set of models that improve on MaaS-3.5 and can understand as well as generate natural language or code Yes
MaaS-3.5 Turbo A set of models that improve on MaaS-3.5 and can understand as well as generate natural language or code Yes
MaaS 3.5 Sonnet(NEW) A set of models with high-quality text generation, multi-domain applicability, and powerful conversational interaction capabilities Yes
MaaS 3 Sonnet The most balanced model between intelligence and speed, making it an excellent choice for enterprise workloads and scalable AI deployments. Yes
MaaS 3 Haiku The fastest and most compact model, designed to deliver near-instantaneous responses and a seamless AI experience, mimicking human interactions. Yes
Claude 3 Opus The most powerful model, delivering state-of-the-art performance on highly complex tasks, demonstrating fluency and human-like understanding. Yes
MaaS 1.0 Pro Offers robust natural language processing and generation capabilities, suitable for a broad spectrum of applications Yes
MaaS 1.5 Pro Possesses enhanced performance and accuracy, capable of handling more intricate linguistic tasks and providing refined output Yes
MaaS 1.5 Flash Primarily focuses on efficiency and swift responsiveness, enabling the completion of natural language processing and generation tasks within a minimal timeframe Yes

Text-to-Image models

Model Description Supported
MaaS-DALL·E A model that can generate and edit images given a natural language prompt Yes
MaaS-Flux-1-schnell A generative model that is fast, produces high-quality images with rich details and diverse styles, and has good prompt compliance. Yes
MaaS Stable-Diffusion-3.5-Large An advanced text-to-image model that utilizes deep learning techniques to transform textual descriptions into high-quality, detail-rich images, widely used in artistic creation, design, and visual content generation. Yes

Embedding models

Model Description Supported
MaaS-embedding-3-large A high-capacity pre-trained model can transform text into high-dimensional vector representations to support high-precision text similarity calculations and other natural language processing tasks. Yes
MaaS-embedding-3-small A lightweight pre-trained model is used to transform text into vectors, making it suitable for applications with low computational resource requirements or those that require fast processing of text similarity. Yes
MaaS-embedding-ada-002 Providing efficient and high-quality text embeddings, suitable for a variety of natural language processing tasks. Yes

Text-to-Speech models

Model Description Supported
MaaS-Ele Capable of transforming written text into natural and high-quality speech, widely applied in fields such as audiobooks, podcasts, and voice assistants. Yes
MaaS-nar Transforming written text into vivid speech provides an efficient solution for video dubbing and multilingual content creation. Yes
MaaS ASpeech
MaaS OSpeech
Can generate natural, fluent, and high-quality speech, supporting multiple languages and speech styles, and can achieve personalized needs through custom speech synthesis. Yes

Speech-to-Text models

Model Description Supported
MaaS Whisper A model that can convert audio into text Yes
MaaS-AFast-asr A model that can quickly recognize and transcribe speech content. Yes
MaaS-Arealtime-asr A model that can perform real-time transcription while processing speech input. Yes
MaaS-ASpeech-Translation A model that can instantly convert the speech content of one language into the speech output of another language, widely used in cross-language communication and multilingual conferences. Yes

Text-to-Video models

Model Description Supported
MaaS Haiper Video The text-to-video model generates corresponding dynamic videos based on written text, greatly enriching the form and appeal of content presentation Yes

The MaaS API system provides easy access to these models, allowing users to leverage their capabilities for various applications.

Upstream Channels

In the MaaS API system, an upstream channel refers to a specific AI service interface that differs in endpoint, model, API version, and other configurations. Upstream channels enable users to connect to various AI service interfaces and leverage their unique capabilities.

By configuring upstream channels, users can select specific services or seamlessly utilize multiple APIs. This flexibility allows users to access a customized array of models and services.

The configuration of upstream channels includes the following elements:

  1. Specifying the endpoint URL: Defining the target address for API requests.

  2. Selecting the desired model: Choosing an appropriate AI model based on specific requirements.

  3. Choosing the API version: Selecting a suitable API version to ensure compatibility and functionality.

  4. Providing necessary account credentials: Ensuring secure system connection to the selected upstream channel.

These configurations ensure that the MaaS API system can connect to the appropriate upstream channels and issue API requests accordingly.

By configuring upstream channels, users can effortlessly switch between different service interfaces and experiment with various models.

In conclusion, upstream channels offer users a highly customizable and adaptable solution, enabling efficient access to a diverse range of MaaS API services.

Upstream Channel Groups

An upstream channel group in the MaaS API system is a combination of multiple channels of the same type, such as text-to-text or text-to-image. By combining channels, it helps users optimize their usage costs while ensuring stability and improving the user experience.

Upstream channel groups offer several advantages:

  • Cost Optimization: By allowing users to leverage multiple channels, upstream channel groups can help distribute the load and reduce the cost per request. This is especially beneficial for users with high-volume needs.

  • Load Balancing: Upstream channel groups can distribute the processing load evenly across multiple channels. This can help prevent any single channel from becoming a bottleneck, ensuring smooth and efficient operation.

  • Resource Utilization: With upstream channel groups, users can make full use of their allocated resources. This can help prevent waste and ensure that users get the most value from their investment.

  • Fault Tolerance: If one channel in a group fails or becomes unavailable, the other channels can continue to handle requests. This can help ensure uninterrupted service and improve the overall reliability of the system.

Users can choose to exclusively use a specific channel within a channel group, based on the channel allocation provided by the MaaS API vendors' commercial policies.

The upstream channel group configuration allows users to leverage the capabilities of multiple channels within a group, providing a flexible and adaptable solution for accessing different models and services.

By utilizing upstream channel groups, users can achieve cost optimization, stability, and enhanced usage experience in the MaaS API system.

User endpoint

The user endpoint is a unique URL provided to each user in the MaaS API system. Each user can own multiple endpoints, each corresponding to an upstream channel group. It is important to note that each endpoint can only be used by one user.

The user endpoint serves as the entry point for making API requests to the MaaS API system. Users can send HTTP requests to their specific endpoint to access the desired models and services provided by the upstream channel group associated with that endpoint.

By utilizing the user endpoint, users can easily interact with the MaaS API system and leverage the capabilities of the chosen upstream channel group. This allows for a personalized and tailored experience, as each user can configure their own set of models and services based on their specific requirements.

The user endpoint is a key component in enabling users to access the power of the MaaS API system and utilize the features and benefits provided by the upstream channel groups.

Seamless Integration

In the MaaS API system, seamless integration refers to the users' ability to effortlessly connect to diverse API services and harness their unique capabilities. Through the configuration of upstream channels, users can select specific services or concurrently utilize multiple APIs, achieving smooth transitions between them. This flexibility empowers users to access a bespoke array of models and services, tailored to their specific requirements.

The system's adaptability enables users to effortlessly navigate between different service interfaces, experimenting with various models without encountering significant obstacles. This seamless integration not only enhances user experience but also fosters innovation by allowing users to explore and combine different AI capabilities, ultimately leading to more sophisticated and efficient solutions.

Conclusion

The MaaS API system provides users with an efficient and cost-effective avenue to access cutting-edge AI models. By streamlining configuration and billing processes, users can focus their efforts on harnessing the formidable capabilities of these models to meet diverse application requirements.

Whether it be text generation, image creation, or voice synthesis, the MaaS API system offers robust support, empowering enterprises to achieve innovation and enhance efficiency across various domains. This comprehensive platform serves as a catalyst for organizations seeking to leverage state-of-the-art AI technologies without the burden of extensive infrastructure development or maintenance.

By offering a seamless interface to a plethora of AI services, the MaaS API system democratizes access to advanced machine learning capabilities, enabling businesses of all sizes to integrate sophisticated AI functionalities into their operations. This, in turn, fosters a landscape of technological advancement and competitive edge in the rapidly evolving digital ecosystem.