JAIST's ambitious project to build a premier Japanese LLM required a partner that could provide not just raw computing power, but also a sophisticated platform to manage the entire model development lifecycle. FPT AI Factory, with its integrated FPT AI Studio and FPT AI Inference services, provided the end-to-end solution JAIST needed.
The collaboration began with a systematic search for the most effective training data combination. Using FPT AI Studio, JAIST’s researchers trained the Qwen3-0.6B model across 768 unique training data combinations, equivalent to 768 separate training runs. This critical phase was also accelerated by utilizing FPT AI Inference’s embedding models to analyze and classify text domains within the mixed training data.
Once the ideal data combination was identified, JAIST embarked on a massive continual pre-training effort using the Qwen2.5-32B as the base model. This process was broken down into three distinct, computationally intensive phases, all managed within FPT AI Studio:
Throughout this complex process, FPT AI Factory's engineers provided close, dedicated support, ensuring the seamless execution of these large-scale training jobs.
For evaluation, JAIST utilized the full capabilities of FPT AI Studio. The continually pretrained models underwent LoRA fine-tuning and were rigorously benchmarked against the Nejumi Leaderboard 3 using the Test Jobs feature. Furthermore, the Interactive Session feature allowed JAIST researchers to serve the fine-tuned models and conduct their own internal, custom benchmarks.