Jonas Mueller
jonasmueller
á
csail.mit.edu
Chief Scientist @
Cleanlab
Hiring interns and
senior
software or ML engineers! Please email me.
About Me
:
I am Chief Scientist and Co-Founder at
Cleanlab
, building
tools
for
data-centric AI
. Previously, I worked on new ML algorithms at
Amazon Web Services
and
Microsoft Research
. Before that, I completed my Ph.D. at the
MIT Computer Science & Artificial Intelligence Lab
.
Publications
[Full list at
Google Scholar
]
Flexible Model Aggregation for Quantile Regression
R Fakoor, T Kim, J Mueller, A Smola, R Tibshirani
Journal of Machine Learning Research (JMLR)
, 2023
Adaptive Interest for Emphatic Reinforcement Learning
M Klissarov, R Fakoor, J Mueller, K Asadi, T Kim, A Smola
Advances in Neural Information Processing Systems (NeurIPS)
, 2022
[Featured as
Spotlight Presentation
at ICML DARL Workshop]
Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node features
J Chen, J Mueller, V Ioannidis, S Adeshina, Y Wang, T Goldstein, D Wipf
International Conference on Learning Representations (ICLR)
, 2022
[Featured as
Spotlight Presentation
]
Deep Learning for the Partially Linear Cox Model
Q Zhong, J Mueller, J Wang
Annals of Statistics
, 2022
Detecting Label Errors in Token Classification Data
WC Wang, J Mueller
NeurIPS 2022 Workshop on Interactive Learning for Natural Language Processing (InterNLP)
, 2022
[
Code to run method
] [
Code to reproduce results
] [
Blog Post
]
CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators
HW Goh, U Tkachenko, J Mueller
NeurIPS 2022 Human in the Loop Learning Workshop
, 2022
[
Code to run method
] [
Code to reproduce results
] [
Blog Post
]
Model-Agnostic Label Quality Scoring to Detect Real-World Label Errors
J Kuan, J Mueller
ICML DataPerf Workshop
, 2022
[
Code
]
Back to the Basics: Revisiting Out-of-Distribution Detection Baselines
J Kuan, J Mueller
ICML Workshop on Principles of Distribution Shift
, 2022
[
Code
]
ResNeSt: Split-Attention Networks
H Zhang, C Wu, Z Zhang, Y Zhu, Z Zhang, H Lin, Y Sun, T He, J Mueller, R Manmatha, M Li, A Smola
IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
, 2022
[Featured in
Synced
,
Analytics India Magazine
,
Pytorch Image Models
] [
Code
]
Towards Automated Distillation: A Systematic Study of Knowledge Distillation in Natural Language Processing
H He, X Shi, J Mueller, S Zha, M Li, G Karypis
International Conference on Automated Machine Learning: Late-Breaking Workshop
, 2022
Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks
C Northcutt, A Athalye, J Mueller
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks
, 2021
[Featured as Oral Presentation and in
Wired
,
VentureBeat
,
MIT Technology Review
,
CNN News18
] [
Webpage
] [
Code
] [
Blog
]
Benchmarking Multimodal AutoML for Tabular Data with Text Fields
X Shi
*
, J Mueller
*
, N Erickson, M Li, A Smola
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks
, 2021
[Featured as
Contributed Talk
at
ICML Workshop on Automated Machine Learning
] [
Code
] [
Benchmark Datasets
]
Overinterpretation reveals image classification model pathologies
B Carter, S Jain, J Mueller, D Gifford
Advances in Neural Information Processing Systems (NeurIPS)
, 2021
[Featured in
TechXplore
,
VentureBeat
,
World Economic Forum
,
NY Tech Media
] [
Lecture Notes
] [
Code
]
Deep Extended Hazard Models for Survival Analysis
Q Zhong, J Mueller, J Wang
Advances in Neural Information Processing Systems (NeurIPS)
, 2021
Continuous Doubly Constrained Batch Reinforcement Learning
R Fakoor, J Mueller, K Asadi, P Chaudhari, A Smola
Advances in Neural Information Processing Systems (NeurIPS)
, 2021
[
Code
]
Robust Reinforcement Learning for Shifting Dynamics During Deployment
S Stanton, R Fakoor, J Mueller, A Wilson, A Smola
NeurIPS Workshop on Safe and Robust Control of Uncertain Systems
, 2021
Deep Learning for Functional Data Analysis with Adaptive Basis Layers
J Yao, J Mueller, J Wang
International Conference on Machine Learning (ICML)
, 2021
[
Code
]
Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation
R Fakoor
*
, J Mueller
*
, P Chaudhari, A Smola
Advances in Neural Information Processing Systems (NeurIPS)
, 2020
[
Lecture Notes
] [
Code
]
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
N Erickson
*
, J Mueller
*
, A Shirkov, H Zhang, P Larroy, M Li, A Smola
ICML Workshop on Automated Machine Learning
, 2020
[
Code
] [
Tutorial
] [Featured in
VentureBeat
,
InfoWorld
,
SiliconAngle
,
Synced
,
SD Times
,
Forbes
]
TraDE: Transformers for Density Estimation
R Fakoor, P Chaudhari, J Mueller, A Smola
ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models
, 2020
Educating Text Autoencoders: Latent Representation Guidance via Denoising
T Shen, J Mueller, R Barzilay, T Jaakkola
International Conference on Machine Learning (ICML)
, 2020
[
Slides
] [
Code
]
Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles
S Jain
*
, G Liu
*
, J Mueller, D Gifford
Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI)
, 2020
Antibody Complementarity Determining Region Design Using High-Capacity Machine Learning
G Liu
*
, H Zeng
*
, J Mueller, B Carter, Z Wang, J Schilz, G Horny, M Birnbaum, S Ewert, D Gifford
Bioinformatics
, 2020
Low-Rank Bandit Methods for High-Dimensional Dynamic Pricing
J Mueller, V Syrgkanis, M Taddy
Advances in Neural Information Processing Systems (NeurIPS)
, 2019
[
Code
]
Recognizing Variables from their Data via Deep Embeddings of Distributions
J Mueller, A Smola
IEEE International Conference on Data Mining (ICDM)
, 2019
[
Slides
]
IMaT: Unsupervised Text Attribute Transfer via Iterative Matching and Translation
Z Jin
*
, D Jin
*
, J Mueller, N Matthews, E Santus
Empirical Methods in Natural Language Processing (EMNLP)
, 2019
[
Data
]
What made you do this? Understanding black-box decisions with sufficient input subsets
B Carter
*
, J Mueller
*
, S Jain, D Gifford
Artificial Intelligence and Statistics (AISTATS)
, 2019
[Featured as Contributed Talk at
NIPS Workshop on Interpretability and Robustness
] [
Slides
] [
Lecture Notes
] [
Code
]
A peninsular structure coordinates asynchronous differentiation with morphogenesis to generate pancreatic islets
N Sharon
*
, R Chawla
*
, J Mueller, J Vanderhooft, J Whitehorn, B Rosenthal, M Gurtler, R Estanboulieh, D Shvartsman, D Gifford, C Trapnell, D Melton
Cell
, 2019
[Featured as a
Research Highlight
in
Nature Reviews Endocrinology
] [
Journal Version
] [
Data
] [
Video
]
Wnt Signaling Separates the Progenitor and Endocrine Compartments during Pancreas Development
N Sharon, J Vanderhooft, J Straubhaar, J Mueller, R Chawla, Q Zhou, E Engquist, C Trapnell, D Gifford, D Melton
Cell Reports
, 2019
[Part of research effort that led to the
first cure for Type 1 Diabetes
] [
Data
]
Modeling Persistent Trends in Distributions
J Mueller, T Jaakkola, D Gifford
Journal of the American Statistical Association (JASA)
, 2018
[
Journal Version
] [
Code
]
Sequence to Better Sequence: Continuous Revision of Combinatorial Structures
J Mueller, D Gifford, T Jaakkola
International Conference on Machine Learning (ICML)
, 2017
[
Slides
] [
Code
]
Learning Optimal Interventions
J Mueller, D Reshef, G Du, T Jaakkola
Artificial Intelligence and Statistics (AISTATS)
, 2017
[Featured as Contributed Talk at
NIPS Workshop on Bayesian Optimization
] [
Slides
] [
Code
]
Siamese Recurrent Architectures for Learning Sentence Similarity
J Mueller, A Thyagarajan
Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI)
, 2016
[
Lecture Notes
] [
Code
]
Principal Differences Analysis: Interpretable Characterization of Differences between Distributions
J Mueller and T Jaakkola
Advances in Neural Information Processing Systems (NIPS)
, 2015
[
Slides
] [
Code
]
General Triallelic Frequency Spectrum Under Demographic Models with Variable Population Size
P Jenkins, J Mueller, Y Song
Genetics
, 2014
[Featured as a
Research Highlight
in
Nature Reviews Genetics
]
Flexible models for understanding and optimizing complex populations
J Mueller
Ph.D. Thesis, MIT Department of Electrical Engineering and Computer Science
, 2018
Modeling temporally-regulated effects on distributions
J Mueller
Masters Thesis, MIT Department of Electrical Engineering and Computer Science
, 2015
Statistical inference of recombination-inducing genic features in Drosophila melanogaster
J Mueller
Honors Thesis, UC Berkeley Department of Statistics
, 2013
*
Equal Contribution