From the Detection of Toxic Spans in Online Discussions to the Analysis of Toxic-to-Civil Transfer ACL22
HateXplain(HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection AAAI21)
e-CARE(e-CARE: a New Dataset for Exploring Explainable Causal Reasoning ACL22)
PROTOTEX: Explaining Model Decisions with Prototype Tensors ACL22
MixGEN(Explaining Toxic Text via Knowledge Enhanced Text Generation NAACL22)
Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processin(NeurIPS 2021)
Alibi Explain: Algorithms for Explaining Machine Learning Models
Learning to Explain: Generating Stable Explanations Fast
SOCIAL BIAS FRAMES: Reasoning about Social and Power Implications of Language