Search Results for author: Qing Sun

Found 20 papers, 5 papers with code

Bifurcated Attention for Single-Context Large-Batch Sampling

no code implementations • 13 Mar 2024 • Ben Athiwaratkun, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Haifeng Qian, Hantian Ding, Qing Sun, Jun Wang, Jiacheng Guo, Liangfu Chen, Parminder Bhatia, Ramesh Nallapati, Sudipta Sengupta, Bing Xiang

In our study, we present bifurcated attention, a method developed for language model inference in single-context batch sampling contexts.

Answer Generation Language Modelling

Paper
Add Code

Elastic Multi-Gradient Descent for Parallel Continual Learning

no code implementations • 2 Jan 2024 • Fan Lyu, Wei Feng, Yuepan Li, Qing Sun, Fanhua Shang, Liang Wan, Liang Wang

The goal of Continual Learning (CL) is to continuously learn from new data streams and accomplish the corresponding tasks.

Continual Learning

Paper
Add Code

Data-Driven Moving Horizon Estimation Using Bayesian Optimization

no code implementations • 12 Nov 2023 • Qing Sun, Shuai Niu, Minrui Fei

In this work, an innovative data-driven moving horizon state estimation is proposed for model dynamic-unknown systems based on Bayesian optimization.

Bayesian Optimization

Paper
Add Code

Enhancing Control Performance through ESN-Based Model Compensation in MPC for Dynamic Systems

no code implementations • 12 Nov 2023 • Shuai Niu, Qing Sun, Minrui Fei, Xuqian Ju

Deriving precise system dynamic models through traditional numerical methods is often a challenging endeavor.

Model Predictive Control

Paper
Add Code

Exploring Continual Learning for Code Generation Models

no code implementations • 5 Jul 2023 • Prateek Yadav, Qing Sun, Hantian Ding, Xiaopeng Li, Dejiao Zhang, Ming Tan, Xiaofei Ma, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Mohit Bansal, Bing Xiang

Large-scale code generation models such as Codex and CodeT5 have achieved impressive performance.

Code Generation Continual Learning

Paper
Add Code

Greener yet Powerful: Taming Large Code Generation Models with Quantization

no code implementations • 9 Mar 2023 • Xiaokai Wei, Sujan Gonugondla, Wasi Ahmad, Shiqi Wang, Baishakhi Ray, Haifeng Qian, Xiaopeng Li, Varun Kumar, Zijian Wang, Yuchen Tian, Qing Sun, Ben Athiwaratkun, Mingyue Shang, Murali Krishna Ramanathan, Parminder Bhatia, Bing Xiang

Such large models incur significant resource usage (in terms of memory, latency, and dollars) as well as carbon footprint.

Code Generation Code Summarization +2

Paper
Add Code

Measuring Asymmetric Gradient Discrepancy in Parallel Continual Learning

no code implementations • ICCV 2023 • Fan Lyu, Qing Sun, Fanhua Shang, Liang Wan, Wei Feng

In Parallel Continual Learning (PCL), the parallel multiple tasks start and end training unpredictably, thus suffering from training conflict and catastrophic forgetting issues.

Continual Learning

Paper
Add Code

Multi-lingual Evaluation of Code Generation Models

2 code implementations • 26 Oct 2022 • Ben Athiwaratkun, Sanjay Krishna Gouda, Zijian Wang, Xiaopeng Li, Yuchen Tian, Ming Tan, Wasi Uddin Ahmad, Shiqi Wang, Qing Sun, Mingyue Shang, Sujan Kumar Gonugondla, Hantian Ding, Varun Kumar, Nathan Fulton, Arash Farahani, Siddhartha Jain, Robert Giaquinto, Haifeng Qian, Murali Krishna Ramanathan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang

Using these benchmarks, we are able to assess the performance of code generation models in a multi-lingual fashion, and discovered generalization ability of language models on out-of-domain languages, advantages of multi-lingual models over mono-lingual, the ability of few-shot prompting to teach the model new languages, and zero-shot translation abilities even on mono-lingual settings.

Code Completion Code Translation +1

Paper
Code

Exploring Example Influence in Continual Learning

1 code implementation • 25 Sep 2022 • Qing Sun, Fan Lyu, Fanhua Shang, Wei Feng, Liang Wan

Continual Learning (CL) sequentially learns new tasks like human beings, with the goal to achieve better Stability (S, remembering past tasks) and Plasticity (P, adapting to new tasks).

Continual Learning

Paper
Code

Learning to Revise References for Faithful Summarization

1 code implementation • 13 Apr 2022 • Griffin Adams, Han-Chin Shing, Qing Sun, Christopher Winestock, Kathleen McKeown, Noémie Elhadad

In real-world scenarios with naturally occurring datasets, reference summaries are noisy and may contain information that cannot be inferred from the source text.

Attribute Clinical Knowledge +2

Paper
Code

Amortized Posterior on Latent Variables in Gaussian Process

no code implementations • 29 Sep 2021 • Qing Sun

Deep neural networks have achieved impressive performance on a variety of domains.

Paper
Add Code

Rethinking Rehearsal in Lifelong Learning: Does An Example Contribute the Plasticity or Stability?

no code implementations • 29 Sep 2021 • Qing Sun, Fan Lyu, Fanhua Shang, Wei Feng, Liang Wan

Traditionally, the primary goal of LL is to achieve the trade-off between the Stability (remembering past tasks) and Plasticity (adapting to new tasks).

Multi-Task Learning

Paper
Add Code

Neural Entity Recognition with Gazetteer based Fusion

no code implementations • Findings (ACL) 2021 • Qing Sun, Parminder Bhatia

Our gazetteer based fusion model is data efficient, achieving +1. 7 micro-F1 gains on the i2b2 dataset using 20% training data, and brings + 4. 7 micro-F1 gains on novel entity mentions never presented during training.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

An Empirical Investigation Towards Efficient Multi-Domain Language Model Pre-training

1 code implementation • EMNLP 2020 • Kristjan Arumae, Qing Sun, Parminder Bhatia

However, in order to achieve state-of-the-art performance on out of domain tasks such as clinical named entity recognition and relation extraction, additional in domain pre-training is required.

Clustering Language Modelling +4

Paper
Code

Learn to Talk via Proactive Knowledge Transfer

no code implementations • 23 Aug 2020 • Qing Sun, James Cross

In this paper, we provide an in-depth analysis of KL-divergence minimization in Forward and Backward orders, which shows that learners are reinforced via on-policy learning in Backward.

Knowledge Distillation Machine Translation +2

Paper
Add Code

Proactive Sequence Generator via Knowledge Acquisition

no code implementations • 25 Sep 2019 • Qing Sun, James Cross, Dmitriy Genzel

Sequence-to-sequence models such as transformers, which are now being used in a wide variety of NLP tasks, typically need to have very high capacity in order to perform well.

Knowledge Distillation Sentence

Paper
Add Code

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning

no code implementations • CVPR 2017 • Qing Sun, Stefan Lee, Dhruv Batra

We develop the first approximate inference algorithm for 1-Best (and M-Best) decoding in bidirectional neural sequence models by extending Beam Search (BS) to reason about both forward and backward time dependencies.

Image Captioning Sentence

Paper
Add Code

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

25 code implementations • 7 Oct 2016 • Ashwin K. Vijayakumar, Michael Cogswell, Ramprasath R. Selvaraju, Qing Sun, Stefan Lee, David Crandall, Dhruv Batra

We observe that our method consistently outperforms BS and previously proposed techniques for diverse decoding from neural sequence models.

Image Captioning Machine Translation +4

29,402

Paper
Code

SubmodBoxes: Near-Optimal Search for a Set of Diverse Object Proposals

no code implementations • NeurIPS 2015 • Qing Sun, Dhruv Batra

This paper formulates the search for a set of bounding boxes (as needed in object proposal generation) as a monotone submodular maximization problem over the space of all possible bounding boxes in an image.

Object Proposal Generation

Paper
Add Code

Active Learning for Structured Probabilistic Models With Histogram Approximation

no code implementations • CVPR 2015 • Qing Sun, Ankit Laddha, Dhruv Batra

Abstract.

Active Learning General Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.