Global EditionASIA 中文雙語Fran?ais
    China
    Home / China / Education

    Guideline to develop AI-backed Chinese language database

    Digitalization of ancient texts promotes cultural heritage, Mandarin learning

    By Zhao Yimeng | China Daily | Updated: 2025-04-01 09:10
    Share
    Share - WeChat

    China is accelerating the digitalization of ancient texts and boosting access to oracle bone script data, aiming to integrate cultural heritage with digital Chinese, officials said on Monday.

    The Ministry of Education, the National Language Commission and the Cyberspace Administration of China issued a guideline to promote the digitalization of the Chinese language and characters. The focus is on developing national language resources and large-scale Chinese language models to support artificial intelligence.

    The guideline aims to establish a national corpus and strategic language resources information database by 2027. By 2035, the country hopes it will have significantly expanded the presence of the Chinese language in global digital and generative AI scenarios.

    Liu Peijun, head of the Department of Language Information Management at the Ministry of Education, said the guideline calls for the digitalization of linguistic and cultural heritage, while promoting the construction of a national digital language and script museum.

    It emphasizes advancing key technologies for ancient text digitalization, enhancing the accessibility of oracle bone script data and launching a multilingual digital education program to facilitate Chinese language learning globally, Liu said at a news conference.

    A key aspect of this initiative is the development of large-scale linguistic data resources. The guideline outlines a plan to build a national corpus with extensive Chinese language datasets to support AI applications.

    Among the pilot projects, Beijing Normal University has launched a large-scale Classical Chinese language model, an AI-driven initiative that sets a new benchmark in the field, Liu said.

    Kang Zhen, vice-president of BNU, said the university has developed a range of digital language databases, including a comprehensive holographic Chinese character database, a digital resource of the ancient Chinese dictionary Shuowen Jiezi, and repositories for ancient inscriptions and handwritten texts.

    These resources have played a crucial role in linguistic research and cultural preservation, Kang added.

    The university's AI Taiyan, a Classical Chinese large language model trained with 1.8 billion parameters, has been designed for high-accuracy interpretation of ancient texts, supporting tasks such as word and phrase explanations, as well as classical-to-modern Chinese translation.

    China is also spearheading the construction of a new national corpus to strengthen linguistic infrastructure in the AI era, said Wang Hui, deputy head of the Ministry of Education's Department of Language Application and Administration.

    "Currently, most linguistic datasets remain limited to single-text formats and specific academic domains, lacking the scale and diversity required for AI applications," Wang said.

    The department has begun planning for the corpus this year, seeking to launch two flagship databases, the Chinese civilization corpus for AI-assisted teaching and research, and the Chinese grand reading system corpus, Wang said.

    Top
    BACK TO THE TOP
    English
    Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
    License for publishing multimedia online 0108263

    Registration Number: 130349
    FOLLOW US
     
    人妻少妇看A偷人无码电影| 日韩人妻无码精品无码中文字幕 | 狠狠躁天天躁无码中文字幕| 最好看的中文字幕2019免费| 中文字幕一区二区精品区| 日韩精品无码免费专区午夜不卡| 无码人妻久久一区二区三区蜜桃| 最近最新免费中文字幕高清| 国产精品无码久久综合| 中文字幕亚洲综合久久| 中文字幕久久精品无码| 乱人伦中文无码视频在线观看| 中文字幕精品久久久久人妻| 精品久久久无码人妻中文字幕| 国产Av激情久久无码天堂| 无码无遮挡又大又爽又黄的视频| 国产高清中文手机在线观看| 亚洲天堂2017无码中文| 国产AV无码专区亚洲AV毛网站| 亚洲综合av永久无码精品一区二区 | 无码精品A∨在线观看| 日韩AV片无码一区二区三区不卡| 暖暖免费在线中文日本| 日本中文字幕在线| 精品无人区无码乱码大片国产| 免费无码VA一区二区三区| 人妻无码一区二区三区免费 | 中文字幕无码一区二区免费| 日韩乱码人妻无码中文字幕视频| 中文字幕久久欲求不满| 中文字幕在线一区二区在线| 精品久久久久中文字幕一区| 亚洲视频中文字幕| 亚洲国产午夜中文字幕精品黄网站 | 亚洲成AV人片天堂网无码| 免费无码午夜福利片69| 精品亚洲成在人线AV无码| 夜夜添无码试看一区二区三区| 亚洲精品无码午夜福利中文字幕| 亚洲国产综合无码一区| 日韩精品无码一区二区三区不卡|