Main || CV || Publications || Software || Visuals and Animations

Curriculum Vitae

Andrey A. Shabalin, Ph.D.

Research Assistant Professor
Department of Psychiatry
University of Utah

383 Colorow Dr, Office 339
Salt Lake City, UT 84108

Phone: (919) 923-8325
Google Scholar:
NCBI Bibliography:


2005 – 2010, Ph.D. in Statistics, Department of Statistics and Operations Research,
University of North Carolina at Chapel Hill. Professor Andrew B. Nobel, adviser
2002 – 2004, M.A. in Economics, New Economic School, Moscow, Russia
1997 – 2002, M.S. in Mathematics (with honors), Moscow State University, Moscow, Russia.


2017 – now, Research Assistant Professor, Department of Psychiatry, University of Utah
2014 – 2017, Assistant Professor, Center for Biomarker Research and Precision Medicine, VCU
2012 – 2014, Postdoctoral Research Associate, VCU, Mentor Edwin van den Oord
2010 – 2012, Postdoctoral Research Associate, UNC, Mentor Fred A. Wright
2004 – 2005, Economist, Centre for Economic and Financial Research, Moscow, Russia
2004 – 2005, Research Assistant, New Economic School, Moscow, Russia


High density methylation QTL analysis in human blood via next-generation sequencing of the methylated genomic DNA fraction (2015)
J.L. McClay*, A.A. Shabalin*(co-first authors), ..., K.A. Aberg, and E.J.C.G. van den Oord
Genome Biology, 16:291
DOI: 10.1186/s13059-015-0842-7, PMID: 26699738, PMCID: PMC4699364

Candidate gene methylation studies are at high risk of erroneous conclusions (2015)
A.A. Shabalin, K.A. Aberg, and E.J.C.G. van den Oord
Epigenomics, 7(1), 13-5
DOI: 10.2217/epi.14.70, PMID: 25687462

An integrated map of structural variation in 2,504 human genomes (2015)
P.H. Sudmant, ..., A.A. Shabalin (co-author 62 of 83)
Nature, 526, 75–81
DOI: 10.1038/nature15394, PMID: 26432246, PMCID: PMC4617611

The Genotype-Tissue Expression (GTEx) pilot analysis: multi-tissue gene regulation in humans (2015)
K.G. Ardlie, ..., A.A. Shabalin (co-author 22 or 139)
Science 348 (6235), 648-660
DOI: 10.1126/science.1262110, PMID: 25954001, PMCID: PMC4547484

A Whole Methylome CpG-SNP Association Study of Psychosis in Blood and Brain Tissue (2015)
E.J.C.G. van den Oord, S.L. Clark, L.Y. Xie, A.A. Shabalin, M.G. Dozmorov, G. Kumar, Swedish Schizophrenia Consortium, V.I. Vladimirov, P.K.E. Magnusson, and K.A. Aberg
Schizophrenia Bulletin, 182
DOI: 10.1093/schbul/sbv182, PMID: 26656881, PMCID: PMC4903046

Deep Sequencing of Three Loci Implicated in Large-Scale Genome-Wide Association Study Smoking Meta-Analyses (2015)
S.L. Clark, J.L. McClay, D.E. Adkins, K.A. Aberg, G. Kumar, S. Nerella, L. Xie, A.L. Collins, J.J. Crowley, C.R. Quakenbush, C.E. Hillard, G. Gao, A.A. Shabalin, R.E. Peterson, W.E. Copeland, J.L. Silberg, H. Maes, P.F. Sullivan, E.J. Costello, and E.J.C.G. van den Oord
Nicotine & Tobacco Research, 166
DOI: 10.1093/ntr/ntv166

Combined whole methylome and genomewide association study implicates CNTN4 in alcohol use (2015)
S.L. Clark, K.A. Aberg, S. Nerella, G. Kumar, J.L. McClay, W. Chen, L.Y. Xie, A. Harada, A.A. Shabalin, G. Gao, S.E. Bergen, C.M. Hultman, P.K.E. Magnusson, P.F. Sullivan, and E.J.C.G. van den Oord
Alcoholism: Clinical and Experimental Research, 39 (8), 1396-1405
DOI: 10.1111/acer.12790, PMID: 26146898, PMCID: PMC4515164

Refinement of schizophrenia GWAS loci using methylome-wide association data (2015)
G. Kumar, S.L. Clark, J.L. McClay, A.A. Shabalin, D.E. Adkins, L. Xie, R. Chan, S. Nerella, Y. Kim, P.F. Sullivan, C.M. Hultman, P.K.E. Magnusson, K.A. Aberg, and E.J.C.G. van den Oord
Human Genetics, 134 (1), 77-87
DOI: 10.1007/s00439-014-1494-5, PMID: 25284466, PMCID: PMC4282961

Quantitative trait locus mapping methods for diversity outbred mice (2014)
D.M. Gatti, K.L. Svenson, A.A. Shabalin, L.Y. Wu, W. Valdar, P. Simecek, N. Goodwin, R. Cheng, D. Pomp, A. Palmer, E.J. Chesler, K.W. Broman, G.A. Churchill
G3: Genes Genomes Genetics, 2014 vol. 4 no. 9, 1623-1633
DOI: 10.1534/g3.114.013748, PMID: 25237114, PMCID: PMC4169154

The Genotype-Tissue Expression (GTEx) project (2013)
J. Lonsdale, J. Thomas, ..., A.A. Shabalin (co-author 98 of 127)
Nature Genetics, 45 (6), 580-585
DOI: 10.1038/ng.2653, PMID: 23715323, PMCID: PMC4010069

Heritability and genomics of gene expression in peripheral blood (2014)
F.A. Wright, ..., A.A. Shabalin (co-author 25 of 39)
Nature Genetics, 46 (5), 430-437
DOI: 10.1038/ng.2951, PMID: 24728292, PMCID: PMC4012342

Resolving the polymorphism-in-probe problem is critical for correct interpretation of expression QTL studies (2013)
A. Ramasamy, D. Trabzuni, J.R. Gibbs, A. Dillman, D.G. Hernandez, S. Arepalli, R. Walker, C. Smith, G.P. Ilori, A.A. Shabalin, Y. Li, A.B. Singleton, M.R. Cookson, J. Hardy, M. Ryten, M.E. Weale
Nucleic Acids Research, 41 (7), e88-e88
DOI: 10.1093/nar/gkt069, PMID: 23435227, PMCID: PMC3627570

Reconstruction of a low-rank matrix in the presence of Gaussian noise (2013)
A.A. Shabalin and A.B. Nobel
Journal of Multivariate Analysis, 118, 67-76
DOI: 10.1016/j.jmva.2013.03.005

Matrix eQTL: Ultra fast eQTL analysis via large matrix operations (2012)
A.A. Shabalin
Bioinformatics, 28 (10): 1353-1358
DOI: 10.1093/bioinformatics/bts163, PMID: 22492648, PMCID: PMC3348564

seeQTL: A searchable database for human eQTLs (2012)
K. Xia, A.A. Shabalin, ..., and F.A. Wright
Bioinformatics, 28 (3): 451-452
DOI: 10.1093/bioinformatics/btr678, PMID: 22171328, PMCID: PMC3268245

Computational tools for discovery and interpretation of Expression Quantitative Trait Loci (eQTL) (2012)
F.A. Wright, A.A. Shabalin, and I. Rusyn
Pharmacogenomics, 13 (3), 343-352
DOI: 10.2217/pgs.11.185, PMID: 22048815, PMCID: PMC3295835

Basal-like Breast Cancer DNA copy number losses identify genes involved in genomic instability, response to therapy, and patient survival (2012)
V.J. Weigman, H.H. Chao, A.A. Shabalin, ..., and C.M. Perou
Breast Cancer Research and Treatment, 133 (3), 865-880
DOI: 10.1007/s10549-011-1846-y, PMCID: PMC3387500

Sex-specific Gene Expression in BXD Mouse Liver (2010)
D. Gatti, N. Zhao, E. Chesler, B. Bradford, A.A. Shabalin, R. Yordanova, L. Lu, and I. Rusyn
Physiological Genomics, 42 (3), 456-468
DOI: 10.1152/physiolgenomics.00110.2009, PMID: 20551147, PMCID: PMC2929887

Finding large average submatrices in high dimensional data (2009)
A.A. Shabalin, V.J.Weigman, C.M. Perou, and A.B. Nobel
Annals of Applied Statistics, 3(3), 985-1012, 2009
DOI: 10.1214/09-AOAS239

FastMap: Fast eQTL mapping in homozygous populations (2009)
D.M. Gatti*, A.A. Shabalin*(co-first authors), T.C. Lam, F.A. Wright, I. Rusyn, and A.B. Nobel
Bioinformatics, 25(4):482, 2009
DOI: 10.1093/bioinformatics/btn648, PMCID: PMC2642639

The Set2/Rpd3S pathway suppresses cryptic transcription without regard to gene length or transcription frequency (2009)
C. Lickwar, B. Rao, A.A. Shabalin, A.B. Nobel, B.D. Strahl, and J.D. Lieb
PLoS ONE, 4(3), e4886, 2009
DOI: 10.1371/journal.pone.0004886, PMID: 19295910, PMCID: PMC2654109

Merging two gene-expression studies via cross-platform normalization (2008)
A.A. Shabalin, H. Tjelmeland, C. Fan, C.M. Perou, and A.B. Nobel
Bioinformatics, 24(9):1154, 2008
DOI: 10.1093/bioinformatics/btn083

Detection of Low Rank Signals in Noise and Fast Correlation Mining with Applications to Large Biological Data (2010)
A.A. Shabalin
Ph.D. Dissertation. UNC-CH, Department of Statistics and Operations Research

Submitted and working papers:

Estimation of Interpretable eQTL Effect Sizes Using a Log of Linear Model
J. Palowitch, A.A. Shabalin, Y. Zhou, A.B. Nobel, and F.A. Wright

Local genetic effects on gene expression across 44 human tissues
F. Aguet, ..., A.A. Shabalin (co-author 24 of 50)

A non-parametric statistical framework for integrating heterogeneous prior information in large-scale multiple testing
A.A. Shabalin, J. Bukzar, J.L. McClay, K.A. Aberg, and E.J.C.G. van den Oord

An empirical bayes approach for Multiple Tissue eQTL Analysis
G. Li, A.A. Shabalin, I. Rusyn, F.A. Wright, and A.B. Nobel

FastMap 2.0: Fast Association Mapping in Heterozygous Populations
D.M. Gatti, A.A. Shabalin, M. Sypa, T.C. Lam, F.A. Wright, A.B. Nobel, and I. Rusyn

Programming languages:

R (creating packages for CRAN and Bioconductor, vectorization, see Matrix eQTL, RaMWAS)
Matlab (multithreaded and GPU programming, vectorization, see LAS, XPN)
Java, C# (multithreaded programming, complex data structures, see LAS, FastMap)
Gauss, SPSS, Stata

Software developed:

Matrix eQTL. Ultra fast eQTL analysis via large matrix operations
 R at CRAN,

RaMWAS. Fast Methylome-Wide Association Study Pipeline for Enrichment Platforms
 R (with C/C++) at Bioconductor,

Filematrix. File-Backed Matrix Class with Convenient Read and Write Access
 R at CRAN,

fastCircularPermutations. Very fast circular permutation analysis on binary outcomes
 R (with C/C++) at GitHub,

ACMEeqtl. Estimation of Interpretable eQTL Effect Sizes Using a Log of Linear Model
 R (with C/C++) at GitHub,

CorrMeta: Fast Association Analysis for eQTL with Related Samples
 R at

DOQTL. Genotyping and QTL Mapping in Diversity Outbred Mice
 R at Bioconductor,

LAS Biclustering. Finding large average submatrices in high-dimensional data.
 C# (with GUI) and Matlab,

FastMap. Fast gene expression quantitative trait loci mapping tool.
 Java (with GUI),

XPN. Cross-platform normalization method for combining gene expression data

SWITCHdna. SupWald identification of DNA copy changes

Teaching Experience:

Introduction to Statistics, Instructor, UNC-CH, Spring – Fall 2009
Responsibilities: conduct lectures, design homework and exams, grade exams.

Econometrics and Time-series analysis, Instructor,
State University of Humanities, Moscow, Russia, Fall 2003 – Spring 2004
Responsibilities: complete charge of the course – choose textbook, conduct lectures, etc.

Applied Time Series Econometrics, Econometrics IV, and Continuous Time Finance,
Instructor Assistant, New Economic School, Moscow, Russia, Fall 2004 – Spring 2005.
Responsibilities: conduct sessions, grade homework and exams.

Elements of Statistics, Instructor Assistant,
International College of Economics and Finance, Moscow, Russia, Fall 2003 – Spring 2004
Responsibilities: conduct sessions, grade homework and exams.

Reviewed for:

Annals of Applied Statistics
BMC Bioinformatics
BMC Genomics
Human Molecular Genetics
IEEE/ACM Transactions on Computational Biology and Bioinformatics
IEEE Transactions on Information Theory
Journal of Computational and Graphical Statistics
Journal of Multivariate Analysis
Nucleic Acids Research
Pattern Recognition
PLOS Genetics
Quantile (Moscow, Russia)
Statistics and Computing
Transactions on Computational Biology and Bioinformatics

Main || CV || Publications || Software || Visuals and Animations