publications
2025
- COLINGQuickLLaMA: Query-aware Inference Acceleration for Large Language ModelsIn International Conference on Computational Linguistics, 2025
2024
- arXivSepLLM: Accelerate Large Language Models by Compressing One Segment into One TokenPreprint arXiv:2412.12094, 2024
- NeurIPSDAPE: Data-Adaptive Positional Encoding for Length ExtrapolationIn Neural Information Processing Systems, 2024
- NeurIPSDiffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language ModelsIn Neural Information Processing Systems, 2024
- ICLR
Spotlight MetaMath: Bootstrap Your Own Mathematical Questions for Large Language ModelsIn International Conference on Learning Representations, 2024 - ICLR
Oral LEGO-Prover: Neural Theorem Proving with Growing LibrariesIn International Conference on Learning Representations, 2024 - Findings of ACLForward-Backward Reasoning in Large Language Models for Mathematical VerificationIn Findings of the Association for Computational Linguistics, 2024
- arXivAccelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi DecodingPreprint arXiv:2410.01699, 2024
- arXivEfficient Multi-modal Large Language Models via Visual Token GroupingPreprint arXiv:2411.17773, 2024
- arXivDAPE V2: Process Attention Score as Feature Map for Length Extrapolationpreprint arXiv:2410.04798, 2024
- arXivDiM: Diffusion Mamba for Efficient High-Resolution Image SynthesisPreprint arXiv:2405.14224, 2024
- arXivOn the Expressive Power of a Variant of the Looped TransformerPreprint arXiv:2402.13572, 2024
2023
- ACLDT-Solver: Automated Theorem Proving with Dynamic-Tree Sampling Guided by Proof-level Value FunctionIn Association for Computational Linguistics, 2023
- ICCV
Oral DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-TuningIn International Conference on Computer Vision, 2023 - ICCVGrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-trainingIn International Conference on Computer Vision, 2023
2022
- CVPRContinual Object Detection via Prototypical Task Correlation Guided Gating MechanismIn Computer Vision and Pattern Recognition, 2022
- ICLRRevisiting Over-smoothing in BERT from the Perspective of GraphIn International Conference on Learning Representations, 2022
- AAAIAutoBERT-Zero: Evolving BERT Backbone from ScratchIn AAAI Conference on Artificial Intelligence, 2022
2021
- ICMLSparseBERT: Rethinking the Importance Analysis in Self-attentionIn International Conference on Machine Learning, 2021
2020
- NeurIPSBridging the Gap between Sample-based and One-shot Neural Architecture Search with BONASNeural Information Processing Systems, 2020
- AAAIEffective Decoding in Graph Auto-Encoder using Triadic ClosureIn AAAI Conference on Artificial Intelligence, 2020