来源:统计学院

7月22日 | Linlong Kong : Debiasing with Sufficient Projection: A General Theoretical Framework for Vector Representations

来源:统计学院发布时间:2024-07-20浏览次数:10

时   间:2024年7月22日15:30 - 16:30

地   点:普陀校区理科大楼A1514

报告人:Linglong Kong 加拿大阿尔伯塔大学教授

主持人:谌自奇华东师范大学研究员

摘   要:

Pre-trained vector representations in natural language processing often inadvertently encode undesirable social biases. Identifying and removing unwanted biased information from vector representation is an evolving and significant challenge. Our study uniquely addresses this issue from the perspective of statistical independence, proposing a framework for reducing bias by transforming vector representations to an unbiased subspace using sufficient projection. The key to our framework lies in its generality: it adeptly mitigates bias across both debiasing and fairness tasks, and across various vector representation types, including word embeddings and output representations of transformer models. Importantly, we establish the connection between debiasing and fairness, offering theoretical guarantees and elucidating our algorithm's efficacy. Through extensive evaluation of intrinsic and extrinsic metrics, our method achieves superior performance in bias reduction while maintaining high task performance, and offers superior computational efficiency.

报告人简介:

Dr. Linglong Kong is a professor in the Department of Mathematical and Statistical Sciences at the University of Alberta. He holds a Canada Research Chair in Statistical Learning and a Canada CIFAR AI Chair. He is a fellow of American Statistical Association (ASA) and a fellow of the Alberta Machine Intelligence Institute (AMII). His publication record includes more than 100 peer-reviewed articles in top journals such as AOS, JASA and JRSSB as well as top conferences such as NeurIPS, ICML, ICDM, AAAI, and IJCAI. Dr. Kong currently serves as associate editor of the Journal of the American Statistical Association, the Annals of Applied Statistics, the Canadian Journal of Statistics, and Statistics and its Interface. Additionally, Dr. Kong was a member of the Executive Committee of the Western North American Region of the International Biometric Society, chair of the ASA Statistical Computing Session program, and chair of the webinar committee. He served as a guest editor of Canadian Journal of Statistics and Statistics and its Interface, associate editor of International Journal of Imaging Systems and Technology, guest associate editor of Frontiers of Neurosciences, chair of the ASA Statistical Imaging Session, and member of the Statistics Society of Canada's Board of Directors. He is interested in the analysis of high-dimensional and neuroimaging data, statistical machine learning, robust statistics and quantile regression, as well as artificial intelligence for smart health.