A Test Job is a service that allows users to evaluate AI models using a test suite. It supports automated, large-scale testing and provides performance metrics to assess model quality before deployment.
Test Jobs: automated, repeatable, and scalable testing using a test suite.
Interactive Sessions: manual, real-time interactions for quick checks or demos.
Anyone who needs to:
Validate model performance at scale
Compare different model versions.
Generate quantitative evaluation metrics.
Ensure model robustness before production deployment.
LLM (Large Language Models): Text-only input
VLM (Vision-Language Models): Text and image input
Models must be instruction-tuned and have all required files uploaded.
Alpaca: Instruction following format.
ShareGPT: Multi-turn conversations.
ShareGPT_Image: Multimodal conversations with images.
Ensure your account has sufficient balance.
Check that your model and dataset meet the required format.
Contact support via Hotline: 1900 638 399 or [email protected]