Lists (3)
Sort Name ascending (A-Z)
Stars
CCL2022汉语学习者文本纠错评测任务赛道二——CGED-8第一名解决方案
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
ccks2022 task9 subtask2 商品同款识别
[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models
Unified Structure Generation for Universal Information Extraction
Baselines for CCKS 2022 Task "Product Knowledge Graph Alignment"
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Source code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification
Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
主要是我是日常看过的不错的文章的资源汇总,方便自己也分享给大家。有些我看过的,就会做简单的解读,没看过的,就先罗列一下,然后之后看了把解读更新上;涉及到搜索/推荐/自然语言处理。
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
Chinese version of GPT2 training code, using BERT tokenizer.
主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
Familia-Visualization is a demo application for Baidu Familia.