大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
-
Updated
May 23, 2024
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
An Implementation of 'Attention is all you need' with Chinese Corpus
汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
基于4-tag标注好的2019中文维基语料库,使用hanlp进行标注
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
20201124到20220710期间的微博热搜中出现过的姓名 (主要为明星、政客、名人、网红、企业家等)
Corpus creator for Chinese Wikipedia
PTT 八卦版問答中文語料
Predicting Audience’s Response from Sketch Comedy and Crosstalk Scripts (A Corpus Supporting Comedy Writers)
Pretrained model for Chinese Scientific Text
搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征
Pre-trained Wikipedia corpus by MITIE
Add a description, image, and links to the chinese-corpus topic page so that developers can more easily learn about it.
To associate your repository with the chinese-corpus topic, visit your repo's landing page and select "manage topics."