Skip to main content
Xuan Wang

Xuan Wang, PhD

Academic Information

Departments Primary - Population Health Sciences

Academic Office Information

xuan.wang@utah.edu

Research Statement

My research interests include statistical methods for surrogate validation, causal inference and missing data analysis, complex survival data analysis, supervised learning, semi-supervised learning, federated transfer learning, etc. Meanwhile, I make great effect in applying these noval statistical methods to analyze real world data, especially electronic health records (EHR) data. For a full list of my publications, please see

https://scholar.google.com/citations?hl=en&user=sH8TVSoAAAAJ&view_op=list_works&sortby=pubdate

https://www.researchgate.net/profile/Xuan-Wang-96

Education History

Undergraduate Beijing Jiaotong University
BS
Doctoral Training Academy of Mathematics and Systems Science of the Chinese Academy of Sciences
PhD
Postdoctoral Fellowship University of Washington
Postdoctoral Fellow
Postdoctoral Fellowship Harvard University
Postdoctoral Fellow

Selected Publications

Journal Article

  1. Wang X, Claggett BL, Tian L, Malachias MVB, Pfeffer MA, Wei L (2023). Quantifying and Interpreting the Prediction Accuracy of Models for the Time of a Cardiovascular Event-Moving Beyond C Statistic: A Review. JAMA cardiology, 8(3), 290-295.
  2. Hou J, Chan SF, Wang X, Cai (2023). Risk prediction with imperfect survival outcome information from electronic health records. Biometrics, 79(1), 190-202.
  3. Wang X, et al (2022). SurvMaximin: Robust federated approach to transporting survival risk prediction models. Journal of biomedical informatics, 134, 104176.
  4. Wang X, Parast L, Han L, Tian L, Cai (2023). Robust approach to combining multiple markers to improve surrogacy. Biometrics, 79(2), 788-798.
  5. Wang X, Zheng Y, Jensen MK, He Z, Cai (2021). Biomarker evaluation under imperfect nested case-control design. Statistics in medicine, 40(18), 4035-4052.
  6. Chan S, Wang X, Jazi¿ I, Peskoe S, Zheng Y, Cai (2021). Developing and evaluating risk prediction models with panel current status data. Biometrics, 77(2), 599-609.
  7. Wang X, Parast L, Tian LU, Cai (2020). Model-free approach to quantifying the proportion of treatment effect explained by a surrogate marker. Biometrika, 107(1), 107-122.
  8. Wang X, Panickan VA, Cai T, Xiong X, Cho K, Cai T, Bourgeois FT (2023). Endovascular aneurysm repair devices as a use case for postmarketing surveillance of medical devices. JAMA internal medicine, 183(10), 1090-1097.
  9. Wang X, Kim DH, Wei L (2021). Quantifying and Interpreting Efficacy of Reduced-Intensity Chemotherapy With Oxaliplatin and Capecitabine on Cancer Control for Advanced Gastroesophageal Cancer Among an Older Population. JAMA oncology, 7(11),
  10. Chen AW, Hong C, Ho YL, Link N, Honerlaw JP, Tanukonda V, Orkaby AR, Qazi S, Melley C, Galloway A, Costa L, Maripuri M, Wang X, Zhang Y, Schubert P, Cai T, He Z, Panickan VA, Rosser M, Tarko L, Dowell S, Feldman C, Kerr G, Gaziano JM, Wilson PWF, Cho K, Cai T, Liao K (2026). Improving classification of myocardial infarction with machine learning in a diverse population. American journal of epidemiology, 195(3), 841-849.
  11. Wang X, Cai T, Tian L, Parast (2025). Model-Free Approach to Evaluate a Censored Intermediate Outcome as a Surrogate for Overall Survival. Statistics in medicine, 44(20-22), e70268.
  12. Wang X, Zhou J, Parast L, Greene (2025). Semiparametric joint modeling to estimate the treatment effect on a longitudinal surrogate with application to chronic kidney disease trials. Biometrics, 81(3),
  13. Hutch MR, Son J, Le TT, Hong C, Wang X, Shakeri Hossein Abad Z, Morris M, Gutiérrez-Sacristán A, Klann JG, Spiridou A, Batugo A, Bellazzi R, Benoit V, Bonzel CL, Bryant WA, Chiudinelli L, Cho K, Das P, González González T, Hanauer DA, Henderson DW, Ho YL, Loh NHW, Makoudjou A, Makwana S, Malovini A, Moal B, Mowery DL, Neuraz A, Samayamuthu MJ, Sanz Vidorreta FJ, Schriver ER, Schubert P, Talbert J, Tan ALM, Tan BWL, Tan BWQ, Tibollo V, Tippman P, Verdy G, Yuan W, Avillach P, Gehlenborg N, Omenn GS, Consortium for Clinical Characterization of COVID-19 by EHR (4CE), Visweswaran S, Cai T, Luo Y, Xia (2024). Neurological diagnoses in hospitalized COVID-19 patients associated with adverse outcomes: A multinational cohort study. PLOS digital health, 3(4), e0000484.
  14. Akcicek EY, Hashemizadeh K, Akcicek H, Kim SE, Hadley JR, Roberts J, Wang X, Guo Y, Balu N, McNally JS, Parker DL, Yuan C, Ma (2025). Qualitative and quantitative reproducibility of 3D MERGE and SNAP sequences for carotid vessel wall imaging across Siemens and Philips 3T scanners. Quantitative imaging in medicine and surgery, 15(4), 3111-3122.
  15. Wang X, Plantinga AM, Xiong X, Cromer SJ, Bonzel CL, Panickan V, Duan R, Hou J, Cai (2024). Comparing Insulin Against Glucagon-Like Peptide-1 Receptor Agonists, Dipeptidyl Peptidase-4 Inhibitors, and Sodium-Glucose Cotransporter 2 Inhibitors on 5-Year Incident Heart Failure Risk for Patients With Type 2 Diabetes Mellitus: Real-World Evidence Study Using Insurance Claims. JMIR diabetes, 9, e58137.
  16. Verma A, Huffman JE, Rodriguez A, Conery M, Liu M, Ho YL, Kim Y, Heise DA, Guare L, Panickan VA, Garcon H, Linares F, Costa L, Goethert I, Tipton R, Honerlaw J, Davies L, Whitbourne S, Cohen J, Posner DC, Sangar R, Murray M, Wang X, Dochtermann DR, Devineni P, Shi Y, Nandi TN, Assimes TL, Brunette CA, Carroll RJ, Clifford R, Duvall S, Gelernter J, Hung A, Iyengar SK, Joseph J, Kember R, Kranzler H, Kripke CM, Levey D, Luoh SW, Merritt VC, Overstreet C, Deak JD, Grant SFA, Polimanti R, Roussos P, Shakt G, Sun YV, Tsao N, Venkatesh S, Voloudakis G, Justice A, Begoli E, Ramoni R, Tourassi G, Pyarajan S, Tsao P, O'Donnell CJ, Muralidhar S, Moser J, Casas JP, Bick AG, Zhou W, Cai T, Voight BF, Cho K, Gaziano JM, Madduri RK, Damrauer S, Liao K (2024). Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program. Science (New York, N.Y.), 385(6706), eadj1182.
  17. Maripuri M, Dey A, Honerlaw J, Hong C, Ho YL, Tanukonda V, Chen AW, Panickan VA, Wang X, Zhang HG, Yang D, Samayamuthu MJ, Morris M, Visweswaran S, Beaulieu-Jones B, Ramoni R, Muralidhar S, Gaziano JM, Liao K, Xia Z, Brat GA, Cai T, Cho (2024). Characterization of Post-COVID-19 Definitions and Clinical Coding Practices: Longitudinal Study. Online journal of public health informatics, 16, e53445.
  18. Wang X, Liu M, Nogues IE, Chen T, Xiong X, Bonzel CL, Zhang H, Hong C, Xia Y, Dahal K, Costa L, Cui J, VA Million Veteran Program, Gaziano JM, Kim SC, Ho YL, Cho K, Cai T, Liao K (2024). Heterogeneous associations between interleukin-6 receptor variants and phenotypes across ancestries and implications for therapy. Scientific reports, 14(1), 8021.
  19. Wang L, Wang X, Liao KP, Cai (2024). Semisupervised transfer learning for evaluation of model classification performance. Biometrics, 80(1),
  20. Wang X, Ayakulangara Panickan V, Cai T, Xiong X, Cho K, Cai T, Bourgeois F (2023). Endovascular Aneurysm Repair Devices as a Use Case for Postmarketing Surveillance of Medical Devices. JAMA internal medicine, 183(10), 1090-1097.
  21. Wang X, Claggett BL, Tian (2023). Use the Receiver Operating Characteristic to Assess Model Accuracy-Reply. JAMA cardiology, 8(10), 998-999.
  22. Sperotto F, Gutiérrez-Sacristán A, Makwana S, Li X, Rofeberg VN, Cai T, Bourgeois FT, Omenn GS, Hanauer DA, Sáez C, Bonzel CL, Bucholz E, Dionne A, Elias MD, García-Barrio N, González TG, Issitt RW, Kernan KF, Laird-Gion J, Maidlow SE, Mandl KD, Ahooyi TM, Moraleda C, Morris M, Moshal KL, Pedrera-Jiménez M, Shah MA, South AM, Spiridou A, Taylor DM, Verdy G, Visweswaran S, Wang X, Xia Z, Zachariasse JM, Consortium for Clinical Characterization of COVID-19 by EHR (4CE), Newburger JW, Avillach (2023). Clinical phenotypes and outcomes in children with multisystem inflammatory syndrome across SARS-CoV-2 variant eras: a multinational study from the 4CE consortium. EClinicalMedicine, 64, 102212.
  23. Hong, C., Wen, J., Zhang, H. G., Ayakulangara Panickan, V., Yang, D. Y., Chen, A. W., . . . other (2025). Label efficient phenotyping for long covid using electronic health records. Digital Medicine, 8(1), 405.
  24. Wang X., Parast, L., Tian, L., & Cai, T (2025). Towards optimal use of surrogate markers to improve power. Sinica,
  25. Dagliati, A., Strasser, Z. H., Abad, Z. S. H., Klann, J. G.,Wagholikar, K. B., Mesa, R., . . . other (2023). Characterization of long covid temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study. Eclinicalmedicine, 64,
  26. Tan, A. L., Getzen, E. J., Hutch, M. R., Strasser, Z. H., Guti´errez-Sacrist´an, A., Le, T. T., . . . other (2023). Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?. Journal of biomedical informatics, 139, 104306.
  27. Zhang, H. G., Honerlaw, J. P., Maripuri, M., Samayamuthu, M. J., Beaulieu-Jones, B. R., Baig, H. S., . . . other (2023). Potential pitfalls in the use of real-world data for studying long covid. Nature medicine, 29(5), 1040-1043.
  28. Moal, B., Orieux, A., Fert´e, T., Neuraz, A., Brat, G. A., Avillach, P., . . . other (2023). Acute respiratory distress syndrome after sars-cov-2 infection on young adult population: International observational federated study based on electronic health records through the 4ce consortium. Plos one, 18(1), e0266985.
  29. Tan, B. W., Tan, B. W., Tan, A. L., Schriver, E. R., Guti´errez-Sacrist´an, A., Das, P., . . . other (2023). Long-term kidney function recovery and mortality after covid-19-associated acute kidney injury: an international multi-centre observational cohort study. EClinicalMedicine, 55,
  30. Guti´errez-Sacrist´an, A., Serret-Larmande, A., Hutch, M. R., S´aez, C., Aronow, B. J., Bhatnagar, S., . . . other (2022). Hospitalizations associated with mental health conditions among 3 adolescents in the us and france during the covid-19 pandemic. JAMA Network Open, 5(12), e2246548-e2246548.
  31. Han, L., Wang X., & Cai, T (2022). Identifying surrogate markers in real-world comparative effectiveness research. Statistics in Medicine, 41(26), 5290-5304.
  32. Fan, C., Song, Y., Wang X., Mao, C., & Xiong, Y (2022). Identification of early derangements of coagulation, hematological and biochemical profiles in patients with acute pancreatitis. Clinical Biochemistry, 109, 37-43.
  33. Zhou, Q. M., Wang X., Zheng, Y., & Cai, T (2022). New weighting methods when cases are only a subset of events in a nested case-control study. Biometrical Journal, 67(7), 1240-1259.
  34. Weber, G. M., Hong, C., Xia, Z., Palmer, N. P., Avillach, P., L'yi, S., . . . other (2022). International comparisons of laboratory values from the 4ce collaborative to predict covid-19 mortality. NPJ digital medicine, 5(1), 74.
  35. Hong, C., Zhang, H. G., L'Yi, S.,Weber, G., Avillach, P., Tan, B. W., . . . other (2022). Changes in laboratory value improvement and mortality rates over the course of the pandemic: an international retrospective cohort study of hospitalised patients infected with sars-cov-2. BMJ open, 12(6), e057725.
  36. Zhang, J., Wang, Q., & Wang X (2022). Surrogate-variable-based model-free feature screening for survival data under the general censoring mechanism. Annals of the Institute of Statistical Mathematics, 74(2), 379-397.
  37. Wang X., Cai, T., Tian, L., Bourgeois, F., & Parast, L (2021). Quantifying the feasibility of shortening clinical trial duration using surrogate markers. Statistics in medicine, 40(28), 6321-6343.
  38. Estiri, H., Strasser, Z. H., Brat, G. A., Semenov, Y. R., Patel, C. J., & Murphy, S. N (2021). Evolving phenotypes of non-hospitalized patients that indicate long covid. BMC medicine, 19, 1-10.
  39. Weber, G. M., Zhang, H. G., L'Yi, S., Bonzel, C.-L., Hong, C., Avillach, P., . . . other (2021). Authorship correction: international changes in covid-19 clinical trajectories across 315 hospitals and 6 countries: retrospective cohort study. Journal of Medical Internet Research, 22(11), e34625.
  40. Weber, G. M., Zhang, H. G., L'Yi, S., Bonzel, C.-L., Hong, C., Avillach, P., . . . other (2021). International changes in covid-19 clinical trajectories across 315 hospitals and 6 countries: retrospective cohort study. Journal of medical Internet research, 23(10), e31400.
  41. Chepelev, L. L., Wang X., Gold, B., Bonzel, C.-L., Rybicki Jr, F., Uyeda, J. W., . . . other (2021). Improved appropriateness of advanced diagnostic imaging after implementation of clinical decision support mechanism. Journal of Digital Imaging, 34, 397-403.
  42. Tong, J., Huang, J., Chubak, J., Wang X., Moore, J. H., Hubbard, R. A., & Chen, Y (2020). An augmented estimation procedure for ehr-based association studies accounting for differential misclassification. Journal of the American Medical Informatics Association, 27(2), 244-253.
  43. Wang X., & Zhou, X.-H (2018). Semiparametric maximum likelihood estimation for the cox model with length-biased survival data. Journal of statistical planning and inference, 196, 163-173.
  44. Maier, M. M., Zhou, X.-H., Chapko, M., Leipertz, S. L., Wang X., & Beste, L. A (2018). Hepatitis c cure is associated with decreased healthcare costs in cirrhotics in retrospective veterans affairs cohort. Digestive diseases and sciences, 63, 1454-1462.
  45. Wang, Q., & Wang X (2018). Analysis of censored data under heteroscedastic transformation regression models with unknown transformation function. Canadian Journal of Statistics, 46(2), 233-245.
  46. Wang X., Beste, L. A., Maier, M. M., & Zhou, X.-H (2016). Double robust estimator of average causal treatment effect for censored medical cost data. Statistics in medicine, 35(18), 3101-3116.
  47. Wang X., Wang, Q., & Zhou, X.-H. A (2015). Partially varying coefficient single-index additive hazard models. Annals of the Institute of Statistical Mathematics, 67, 817-841.
  48. Wang X., & Wang, Q (2015). Semiparametric linear transformation model with differential measurement error and validation sampling. Journal of Multivariate Analysis, 141, 67-80.
  49. Hutch, M. R., Son, J., Le, T. T., Hong, C., Wang X., Abad, Z. S. H., . . . other (2025). Correction: Neurological diagnoses in hospitalized covid-19 patients associated with adverse outcomes: A multinational cohort study. PLOS Digital Health, 4(7), e000957.

Letter

  1. Wang X, Ludmir EB, Wei L (2025). Assessing Clinical Utility of Datopotamab Deruxtecan Versus Chemotherapy for Breast Cancer. Journal of clinical oncology, 43(18), 2136-2137.

Other

  1. Cheng, D., Wang X., McDermott, G. C., Hanberg, J. S., Love, Z., Zhong, K., . . . other (2025). Preprint: Inferring rheumatoid arthritis disease activity status from the electronic health records across health systems to enable real-world data studies.