Publications

Additional information may be found at Google Scholar, dblp, and Semantic Scholar.

  • On the Sample Complexity of Robust Binary Hypothesis Testing Shankar Vallinayagam, Ankit Pensia, and Varun Jog.
    Manuscript, 2026
    arXiv
  • Robust Regression with Adaptive Contamination in Response:
    Optimal Rates and Computational Barriers
    with Ilias Diakonikolas, Chao Gao, Daniel M. Kane, and Dong Xie.
    Manuscript, 2026
    arXiv
  • High-dimensional estimation with missing data:
    Statistical and computational limits
    with Kabir A. Verchand, Saminul Haque, and Rohith Kuditipudi.
    Manuscript, 2026
    arXiv
  • Information-Computation Tradeoffs for Noiseless Linear Regression with Oblivious Contamination with Ilias Diakonikolas, Chao Gao, Daniel M. Kane, and John Lafferty.
    Advances in Neural Information Processing Systems (NeurIPS), 2025
    arXivconference version
  • SoS Certificates for Sparse Singular Values and Their Applications:
    Robust Statistics, Subspace Distortion, and More
    with Ilias Diakonikolas, Samuel B. Hopkins, and Stefan Tiegel.
    Symposium on Theory of Computing (STOC), 2025
    arXivconference version
  • SoS Certifiability of Subgaussian Distributions and its Algorithmic Applications with Ilias Diakonikolas, Samuel B. Hopkins, and Stefan Tiegel.
    Symposium on Theory of Computing (STOC), 2025
    arXivconference versionslides (10min)slides (25min)
  • The Sample Complexity of Simple Binary Hypothesis Testing:
    Tight Bounds with Sequential Interactivity and Information Constraints
    Hadi Kazemi, Ankit Pensia, and Varun Jog.
    Conference on Learning Theory (COLT), 2025
    arXivconference version
  • Optimal Robust Estimation under Local and Global Corruptions:
    Stronger Adversary and Smaller Error
    with Thanasis Pittas.
    Conference on Learning Theory (COLT), 2025
    arXivconference version
  • A Sub-Quadratic Time Algorithm for Robust Sparse Mean Estimation Ankit Pensia.
    International Conference on Machine Learning (ICML), 2024 (Spotlight)
    arXivconference versionslides (of a survey; 1hr)
  • The Sample Complexity of Simple Binary Hypothesis Testing with Varun Jog and Po-Ling Loh.
    Conference on Learning Theory (COLT), 2024
    arXivconference versionslides (10min)
  • Simple Binary Hypothesis Testing under Local Differential Privacy and Communication Constraints with Amir R. Asadi, Varun Jog, and Po-Ling Loh.
    IEEE Transactions on Information Theory (Trans. Inf. Theory), 2024
    An extended abstract appeared at Conference on Learning Theory (COLT), 2023

    arXivjournal versionslides (20min)slides (1hr)Code
  • Black-Box $k$-to-1-PCA Reductions: Theory and Applications with Arun Jambulapati, Syamantak Kumar, Jerry Li, Shourya Pandey, and Kevin Tian.
    Conference on Learning Theory (COLT), 2024
    arXivconference versionslides (10min)
  • Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination with Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, and Thanasis Pittas.
    International Conference on Machine Learning (ICML), 2024
    arXivconference version
  • Semi-supervised Group DRO: Combating Sparsity with Unlabeled Data with Pranjal Awasthi and Satyen Kale.
    International Conference on Algorithmic Learning Theory (ALT), 2024
    conference version
  • Robust regression with covariate filtering: Heavy tails and adversarial contamination with Varun Jog and Po-Ling Loh.
    Journal of the American Statistical Association (JASA), 2024
    arXivjournal versionCode
  • Communication-constrained hypothesis testing:
    Optimality, robustness, and reverse data processing inequalities
    with Varun Jog and Po-Ling Loh.
    IEEE Transactions on Information Theory (Trans. Inf. Theory), 2024
    A shorter version appeared at ISIT 2022

    arXivjournal version
  • Near-Optimal Algorithms for Gaussians with Huber Contamination:
    Mean Estimation and Linear Regression
    with Ilias Diakonikolas, Daniel M. Kane, and Thanasis Pittas.
    Advances in Neural Information Processing Systems (NeurIPS), 2023
    arXivconference version
  • A Spectral Algorithm for List-Decodable Covariance Estimation in Relative Frobenius Norm with Ilias Diakonikolas, Daniel M. Kane, Jasper C.H. Lee, and Thanasis Pittas.
    Advances in Neural Information Processing Systems (NeurIPS), 2023 (Spotlight)
    arXivconference version
  • Nearly-Linear Time and Streaming Algorithms for Outlier-Robust PCA with Ilias Diakonikolas, Daniel M. Kane, and Thanasis Pittas.
    International Conference on Machine Learning (ICML), 2023
    arXivconference version
  • Gaussian Mean Testing Made Simple with Ilias Diakonikolas and Daniel M. Kane.
    SIAM Symposium on Simplicity in Algorithms (SOSA), 2023
    arXivconference version
  • Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions with Ilias Diakonikolas, Daniel M. Kane, and Jasper C.H. Lee.
    Advances in Neural Information Processing Systems (NeurIPS), 2022
    arXivconference version
  • List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering with Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, and Thanasis Pittas.
    Advances in Neural Information Processing Systems (NeurIPS), 2022 (Oral)
    arXivconference version
  • Robust Sparse Mean Estimation via Sum of Squares with Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, and Thanasis Pittas.
    Conference on Learning Theory (COLT), 2022
    arXivconference version
  • Streaming Algorithms for High-Dimensional Robust Statistics with Ilias Diakonikolas, Daniel M. Kane, and Thanasis Pittas.
    International Conference on Machine Learning (ICML), 2022
    arXivconference version
  • Sharp Concentration Inequalities for the Centered Relative Entropy with Alankrita Bhatt.
    Information and Inference: a Journal of the IMA, 2022
    arXivjournal version
  • Statistical Query Lower Bounds for List-Decodable Linear Regression with Ilias Diakonikolas, Daniel M. Kane, Thanasis Pittas, and Alistair Stewart.
    Advances in Neural Information Processing Systems (NeurIPS), 2021 (Spotlight)
    arXivconference version
  • Estimating location parameters in sample-heterogeneous distributions with Varun Jog and Po-Ling Loh.
    Information and Inference: a Journal of the IMA, 2021
    A shorter version of this article appeared at ISIT 2019

    arXivjournal versionPDF
  • Outlier Robust Mean Estimation with Subgaussian Rates via Stability with Ilias Diakonikolas and Daniel M. Kane.
    Advances in Neural Information Processing Systems (NeurIPS), 2020
    arXivconference version
  • Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization is Sufficient with Shashank Rajput, Alliot Nagle, Harit Vishwakarma, and Dimitris Papailiopoulos.
    Advances in Neural Information Processing Systems (NeurIPS), 2020 (Spotlight)
    arXivconference version
  • Extracting robust and accurate features via a robust information bottleneck with Varun Jog and Po-Ling Loh.
    IEEE Journal on Selected Areas in Information Theory (JSAIT), 2020
    journal versionPDF
  • Deep Topic Models for Multi-label Learning with Rajat Panda, Nikhil Mehta, Mingyuan Zhou, and Piyush Rai.
    International Conference on Artificial Intelligence and Statistics (AISTATS), 2019
    conference version
  • Generalization Error Bounds for Noisy, Iterative Algorithms with Varun Jog and Po-Ling Loh.
    IEEE International Symposium on Information Theory (ISIT), 2018
    arXivconference version