About me

Hi, I am Jinwon Sohn from South Korea and currently I am a fourth-year Ph.D. candidate in Statistics at Purdue University. Before entering the Ph.D. program, I worked at Datarize as a data scientist, contributing to evaluating recommendation models. During the Ph.D. program, my research has focused on integrating fairness-aware machine learning and generative modeling from statistical perspectives under the supervision of Professor Qifan Song. Plus, with active collaboration research, I have been expanding my research areas such as differential privacy and principal curve. Including the aforementioned topics, I am also interested in developing statistical methodologies particularly for synthetic data ultimately toward trustworthy statistical inference. I always welcome collaboration research. Feel free to reach out!

Research Interests 💡

  • Fairness-aware Machine Learning, Generative Modeling, Bayesian Statistics, Differential Privacy, Principal Curve

News ⭐

  • [1-14-2025] Lim, T., Nam, K., and Sohn, J. Monotone Curve Estimation with Convex Duality is now available at arxiv.

  • [11-21-2024] Jinwon Sohn will present a talk at Graduate Student Workshop hosted by Statistics at Purdue. Title: Parallelly Tempered Generative Adversarial Networks.

  • [11-19-2024] Sohn, J. and Song, Q. Parallelly Tempered Generative Adversarial Networks.. This is under revision for the special issue (Statistical Science in Artificial Intelligence) in Journal of American Statistical Association.

  • [05-03-2024] Jinwon Sohn is selected for 2024-2025 Ross Lynn Research Scholar Grant for Statistics at Purdue University.

Recent Publications 📚

  • Sohn, J., Song, Q., & Lin, G. (2024). Fair Supervised Learning with A Simple Random Sampler of Sensitive Attributes. AISTAT (pp. 1594-1602). PMLR.
  • Kang, T., Kim, S., Sohn, J., & Awan, J. (2024). Differentially Private Topological Data Analysis. Journal of Machine Learning Research.
  • Sohn, J., Jeong, S., Cho, Y. M., & Park, T. (2023). Functional Clustering Methods for Binary Longitudinal Data with Temporal Heterogeneity. Computational Statistics & Data Analysis, 185, 107766.