
XiChen: An observation-scalable fully AI-driven global weather forecasting system with 4D variational knowledgeAuthors: Wuxin Wang, Weicheng Ni, Lilan Huang, Tao Han, Ben Fei, Shuo Ma, Taikang Yuan, Yanlai Zhao, Kefeng Deng, Xiaoyong Li, Boheng Duan, Lei Bai, Kaijun Ren
XiChen is the first observation-scalable fully AI-driven global weather forecasting system. Its entire pipeline, from Data Assimilation (DA) to 10-day medium-range forecasting, can be accomplished within only 17 seconds using a single A100 GPU. This speed represents an acceleration exceeding 400-fold compared to the computational time required by operational Numerical Weather Prediction (NWP) systems.
The system is architected upon a foundation model that is initially pre-trained for weather forecasting and subsequently fine-tuned to function as both observation operators and DA models. Crucially, the integration of four-dimensional variational (4DVar) knowledge ensures that XiChen’s DA and medium-range forecasting accuracy rivals that of operational NWP systems.
XiChen demonstrates high scalability and robustness by employing a cascaded sequential DA framework to effectively assimilate both conventional observations (GDAS prepbufr) and raw satellite observations (AMSU-A and MHS). This design allows for the future integration of new observations simply by fine-tuning the respective observation operators and DA model components, which is critical for operational deployment.
In terms of performance, XiChen achieves a skillful weather forecasting lead time exceeding 8.25 days (with ACC of Z500 > 0.6). This result is comparable to the Global Forecasting System (GFS) and substantially surpasses the performance of other end-to-end AI-based global weather forecasting systems, such as Aardvark (less than 8 days) and GraphDOP (about 5 days).
A dual DA framework is implemented to operationalize XiChen as a continuous forecasting system. This framework utilizes separate 12-hour and 3-hour Data Assimilation Windows (DAW) to circumvent the multi-hour latency characteristic of high-resolution systems (like IFS HRES), thereby enabling the real-time acquisition of medium-range forecast products.