LA-paper-report-talk-etc

来自SUDA-HLT
Zhli讨论 | 贡献2025年11月7日 (五) 02:06的版本 →‎Papers in English
跳到导航 跳到搜索

请大家按照规则不断完善此页面。包含pdf ppt codes等。论文按年度,先中文、再英文;先期刊、再会议;先录用时间、后发表(开会)时间。尽量用英文吧。

文献格式和内容与内部wiki保持一致(尽量避免重复劳动)

  • pdf ppt等附件都可以放到外网可以访问的地方,放一个地方,避免重复
  • 基金号等信息,这个页面上删掉,不要写

bib and abstract

bib

摘要和基金信息请看内部wiki


Talks(报告)

  • 视频尽可能都放在了哔哩哔哩上了(搜用户名:LAGroup)
  • 李正华2025年4月19日受邀做报告:句法分析研究进展和思考
  • 李正华2024年11月24日受南师大李斌老师邀请做报告:大模型时代如何做研究:一些思考(文本纠错任务上的大模型相关工作,作为例子)
  • 李正华2023年4月8日受北语杨天麟老师邀请做报告:大模型时代句法语义研究何去何从
  • 李正华2022年11月29日受复旦邱锡鹏老师邀请做报告:基于适配句法知识的文本纠错
  • 李正华2022年11月11日受邀对COLING2022 Best Paper做英文报告
  • 2022.8.27:江苏省人工智能大会(报告题目:汉语文本纠错近年进展:数据集和模型)
  • 2021.7.27:《数据标注师资培训》(哈工大大数据集团)
  • 2019年9月28日句法标注培训(LA组介绍):

Competition or Shared Tasks

  • 刘亚慧、乔子恒、李正华、龚晨、张民. 2025.8. CCL-2025 第三届汉语框架语义解析评测, 二等奖 比赛榜单评测结果


  • 周厚全,乔子恒,蒋浩辰,刘雨萌. 金山办公2024算法挑战赛-中文文本智能校对大赛,第一名,一等奖
  • 王学彬, 李正华. 2024.5. 第一届古汉语断句标点评测(EvaHan2024)评测,二等奖*(COLING-2024 workshop)


  • 辜仰淦,周仕林,李正华. 2023年8月. CCL中文抽象语义表示解析评测(一等奖)官网评测报告
  • 蒋浩辰,刘雨萌,周厚全,乔子恒,章波,李辰,李正华,张民. CCL汉语学习者文本纠错评测(封闭、开放双赛道第一). 2023年8月. S&A-CCL2023评测报告.pdf


  • 2022.12法研杯第一名 CAIL-2022 法律文本纠错
  • 我们组织了两届跨领域句法分析评测:CCL-2021和NLPCC-2019

Awards

  • 2024. 第三届全国大模型智能生成大会(CIPS-LMG 2024)优秀海报奖(EMNLP-2024论文)
  • 2023. CCF-NLPCC“青年新锐学者” (Young Outstanding Scientist Award)
  • 2022. SudaNLP团队张宇同学的硕士论文《基于树形条件随机场的高阶句法分析》被评选为2022年度江苏省优秀学术型硕士学位论文 wechat-sudanlp-news
  • 2022. Coling best paper
  • 2021. 章波,江苏省优秀学术型硕士论文;同时获江苏省计算机学会优秀硕士论文《面向依存句法的树库转化与应用研究》
  • 2020. NLPCC best paper


同学必须深入学习的东西

LAGroup 同学必须深入学习的东西 慢慢完善

2025

Journal/Conference Papers in Chinese

Papers in English

  • Yahui Liu, Zhenghua Li, Chen Gong*, Shilin Zhou,Min Zhang. Annotation error detection in painstakingly annotated data: Part-of-speech tagging as a case study. Expert Systems With Applications (ESWA), 2025,290:128374. official [Journal]
  • Ziyan Zhang, Yang Hou, Chen Gong*, Zhenghua Li. Self-Correction Makes LLMs Better Parsers. In Findings of EMNLP 2025, Suzhou, China. Arxiv.
  • Houquan Zhou, Bo Zhang, Zhenghua Li*, Ming Yan, and Min Zhang. 2025. A Training-free LLM-based Approach to General Chinese Character Error Correction. In Proceedings of ACL, pages 13827–13852, Vienna, Austria. Association for Computational Linguistics. official July 27 - August 1
  • Ziheng Qiao, Houquan Zhou, Zhenghua Li*. Mixture of Small and Large Models for Chinese Spelling Check. ACL, pages 28298–28311, Vienna, Austria. acl-anthology
  • Ziheng Qiao, Houquan Zhou, Yumeng Liu, Zhenghua Li*, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang. DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check. ACL, pages 28312–28324, Vienna, Austria. acl-anthology
  • Yang Hou, Zhenghua Li*. Dynamic Head Selection for Neural Lexicalized Constituency Parsing. In Proceedings of ACL 2025, pages 16141–16155, Vienna, Austria. acl-anthology
  • Yang Hou, Zhenghua Li*. Span-based Semantic Role Labeling as Lexicalized Constituency Tree Parsing. In Findings of ACL 2025, pages 10701–10713, Vienna, Austria. acl-anthology
  • Yanggan Gu, Junzhuo Li, Sirui Huang, Xin Zou, Zhenghua Li*, Xuming Hu*. Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models. In Findings of ACL 2025, pages 15959–15973, Vienna, Austria. acl-anthology

2024

Journal/Conference Papers in Chinese

Conference Papers in English

  • Houquan Zhou, Zhenghua Li*, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang. 2024. A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models. EMNLP. pp 17446–17467. arxiv acl-anthology November 12-16 Miami, Florida (USA)
  • Xuebin Wang, Zhenghua Li*. 2024. Two Sequence Labeling Approaches to Sentence Segmentation and Punctuation Prediction for Classic Chinese Texts. Acl-anthology the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024. pages 237–241 25 May, 2024 Torino (Italia) Turin (Italy)

20-25 May, 2024. Torino (Italia) Turin (Italy)

2023

Conference Papers in English

  • Houquan Zhou, Yumeng Liu, Zhenghua Li*, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang. 2023. Improving Seq2Seq Grammatical Error Correction via Decoding Interventions. Findings of EMNLP, pages 7393–7405. arxivcodecitation
  • Yue Zhang, Leyang Cui, Enbo Zhao, Wei Bi, Shuming Shi. 2023. RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation. EMNLP 2023.arxiv code pages 16780–16793; December 6-10, 2023
  • Yue Zhang, Bo Zhang, Haochen Jiang, Zhenghua Li*, Chen Li, Fei Huang, Min Zhang. 2023. NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts. Findings of ACL 2023. pp 9935-9951. official arxiv code

2022

Journal/Conference Papers in Chinese

Journal Papers in English

  • Chen Gong, Zhenghua Li* and Min Zhang. Neural Coupled Sequence Labeling for Heterogeneous Annotation Conversion. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, 30:1624-1636.official

Conference Papers In English

  • Yue Zhang, Bo Zhang, Zhenghua Li*, Zuyi Bao, Chen Li, and Min Zhang. 2022. SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser. In EMNLP, pages 2518–2531. arxivofficial pdfvideo
  • Yu Zhang, Qingrong Xia, Shilin Zhou, Yong Jiang, Guohong Fu*, Min Zhang. 2022. Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures inside Arguments. In COLING, pages 4212–4227. arxiv official pdf video
  • Shilin Zhou, Qingrong Xia, Zhenghua Li*, Yu Zhang, Yu Hong, and Min Zhang. 2022. Fast and Accurate End-to-End Span-based Semantic Role Labeling as Word-based Graph Parsing. In COLING, pages 4160–4171. arxiv official pdf video (best paper!)
  • Yahui Liu, Haoping Yang, Chen Gong*, Qingrong Xia, Zhenghua Li, Min Zhang. 2022. MuCPAD: A Multi-Domain Chinese Predicate-Argument Dataset. In NAACL, pages 1707-1717. arxivofficial pdfvideo
  • Yue Zhang, Zhenghua Li*, Zuyi Bao, Jiacheng Li, Bo Zhang, Chen Li, Fei Huang, Min Zhang. 2022. MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction. In NAACL, pages 3118–3130. arxivofficial pdfvideo
  • Ying Li, Shuaike Li, Min Zhang. 2022. Semi-supervised Domain Adaptation for Dependency Parsing with Dynamic Matching Network. In 'ACL 2022, pages 1035--1045. official-pdf camera ready pdf video.
  • Houquan Zhou, Yang Li, Zhenghua Li, and Min Zhang. 2022. Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging. In Findings of the Association for Computational Linguistics: ACL 2022, pages 3276–3290, Dublin, Ireland. Association for Computational Linguistics. pdf camera ready pdf official pdf video

2021

Journal/Conference Papers in Chinese

Conference Papers In English

  • Yang Hou, Houquan Zhou, Zhenghua Li*, Yu Zhang, Min Zhang, Zhefeng Wang, Baoxing Huai, Nicholas Jing Yuan. A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing. Proceedings of CoNLL-2021, pp. 290–299. Punta Cana, Dominican Republic (Online), 10-11 Nov. 2021. 文件:CoNLL 2021 yhou official version.pdf
  • Ying Li, Meishan Zhang, Zhenghua Li*, Min Zhang, Zhefeng Wang, Baoxing Huai, Nicholas Jing Yuan. APGN: Adversarial and Parameter Generation Networks for Multi-Source Cross-Domain Dependency Parsing. Proceedings of EMNLP-2021 Findings, pp. 1727–1733. Punta Cana, Dominican Republic (Online), 7-11 Nov. 2021. 文件:EMNLP 2021 yli camera ready.pdf
  • Chen Gong, Saihao Huang, Houquan Zhou, Zhenghua Li*, Min Zhang, Zhefeng Wang, Baoxing Huai, Nicholas Jing Yuan. An In-depth Study on Internal Structure of Chinese Words. Proceedings of ACL-2021, pp. 5823–5833. Online, Virtual Event, 1-6 Aug. 2021. 文件:2021.acl-long.452.pdf
  • Qingrong Xia, Bo Zhang, Rui Wang, Zhenghua Li*, Yue Zhang, Fei Huang, Luo Si, Min Zhang. 2021. A Unified Span-Based Approach for Opinion Mining with Syntactic Constituents. Proceedings of NAACL-2021, pp. 1795-1804. Mexico City, Mexico (Online), 6-11 June. 2021. 文件:2021.naacl-main.144.pdf

2020

Journal Papers in Chinese

Journal Papers In English

Conference Papers In English

  • Ying Li, Zhenghua Li* and Min Zhang. Semi-supervised Domain Adaptation for Dependency Parsing via Improved Contextualized Word Representations. Proceedings of COLING-2020. pp. 3806–3817. Barcelona, Spain (Online), 8-13 Dec. 2020.pdf 文件:Liying-2020.coling-main.338.pdf
  • Chen Gong, Zhenghua Li*, Bowei Zou and Min Zhang. Multi-grained Chinese Word Segmentation with Weakly Labeled Data. Proceedings of COLING-2020. pp. 2026–2036. Barcelona, Spain (Online), 8-13 Dec. 2020. pdf文件:Gongchen-2020.coling-main.183.pdf
  • Lijie Wang, Ao Zhang, Kun Wu, Ke Sun, Zhenghua Li, Hua Wu, Min Zhang and Haifeng Wang. DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset. Proceedings of EMNLP-2020. pp. 6923-6935. Online, 16-20 Nov. 2020. pdf文件:2020.emnlp-main.562.pdf
  • Houquan Zhou, Yu Zhang, Zhenghua Li, and Min Zhang. Is POS Tagging Necessary or Even Helpful for Neural Dependency Parsing? Proceedings of NLPCC-2020, pp. 179--191. Zhengzhou, China, 14 Oct. - 18 Oct. 2020. (pdf) (pdf-official) (pdf-official-w-content) (best paper!)
  • Yu Zhao, Mingyue Zhou, Zhenghua Li, and Min Zhang. Dependency Parsing with Noisy Multi-Annotation Data. Proceedings of NLPCC-2020, pp. 120-131. Zhengzhou, China, 14 Oct. - 18 Oct. 2020. (pdf)(pdf-official) (pdf-official-w-content)
  • Yu Zhang, Zhenghua Li, Min Zhang. 2020. Efficient Second-Order TreeCRF for Neural Dependency Parsing. Proceedings of ACL-2020, pp. 3295-3305. Seattle, America, 5-10 Jul. 2020. [pdf-official] [pdf] [video]
    • A very good paper: Timothy Dozat, Christopher D. Manning. ICLR-2017. Deep Biaffine Attention for Neural Dependency Parsing. arxiv
  • Bo Zhang, Yue Zhang, Rui Wang, Zhenghua Li, Min Zhang. Syntax-Aware Opinion Role Labeling with Dependency Graph Convolutional Networks. Proceedings of ACL-2020, pp. 3249-3258. Seattle, America, 5-10 Jul. 2020. pdf 文件:Zhangbo-acl2020-ppt-5-24.pdf

before 2020

  • Meishan Zhang, Zhenghua Li, Guohong Fu and Min Zhang. Syntax-Enhanced Neural Machine Translation with Syntax-Aware Word Representations. In Proceedings of the NAACL-2019. pp. 1151–1161. Seattle, America, 5-10 Jun. 2019. pdf
  • Bowen Wu, Jiayuan Chao, Baoxun Wang, Zhenghua Li and Min Zhang. Abstractive Summarization via Continuous Copy. EMNLP-2019 Workshop Summarization Submission. Aug 20, 2019. (not accepted)

更早的论文,请点击此

202? Template

Talks and Misc.

Journal/Conference Papers in Chinese

Journal Papers In English

Conference Papers In English