2026
The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM
ICLR 2026; JKAIA 2025 Best Paper Award
2025
An Analysis of Concept Bottleneck Models: Measuring, Understanding, and Mitigating the Impact of Noisy Annotations
NeurIPS 2025
Critical Influence of Overparameterization on Sharpness-aware Minimization
UAI 2025; ICML 2023 Workshop on High-dimensional Learning Dynamics; JKAIA 2023 Best Paper Award
SAFE: Finding Sparse and Flat Minima to Improve Pruning
ICML 2025 (Spotlight)
SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation
ICML 2025
ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models
ICLR 2025
2024
Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization
EMNLP 2024
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers
ECCV 2024; ICLR 2023 Workshop
JaxPruner: A Concise Library for Sparsity Research
CPAL 2024 (Oral); ICLR 2023 Workshop on Sparsity in Neural Networks
2023
Pruning Neural Networks with Velocity-Constrained Optimization
NeurIPS 2023 Workshop on Optimization for Machine Learning
A Closer Look at the Intervention Procedure of Concept Bottleneck Models
ICML 2023; NeurIPS 2022 Workshop on Trustworthy and Socially Responsible ML
On the Effectiveness of Sharpness-Aware Minimization with Large Mini-batches
ICML 2023 Workshop on High-dimensional Learning Dynamics
FedFwd: Federated Learning without Backpropagation
ICML 2023 Workshop on Federated Learning and Analytics in Practice
Semi-Supervised Concept Bottleneck Models
ICML 2023 Workshop on Artificial Intelligence and Human-Computer Interaction
Almost Sure Last Iterate Convergence of Sharpness-Aware Minimization
ICLR 2023 Tiny Papers
Previous
Understanding the Effects of Data Parallelism and Sparsity on Neural Network Training
ICLR 2021
Optimal Mini-Batch Size for Stochastic Gradient Methods
ICLR 2021 Workshop on Science and Engineering of Deep Learning
A Signal Propagation Perspective for Pruning Neural Networks at Initialization
ICLR 2020 (Spotlight)
Synthesizing a Scene-Specific Pedestrian Detector and Pose Estimator for Static Video Surveillance
IJCV 2018
DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
CVPR 2017 (Spotlight)
Modeling of Dynamic Environments for Visual Forecasting of American Football Plays
M.S. Thesis, Carnegie Mellon University, Dec 2015