Papers Read on AI

ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code

2024/7/8

Despite Large Language Models (LLMs) like GPT-4 achieving impressive results in function-level code

Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

2024/7/5

In this work, we introduce Unique3D, a novel image-to-3D framework for efficiently generating high-q

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

2024/7/4

We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achie

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

2024/6/28

The conventional recipe for maximizing model accuracy is to (1) train multiple models with various h

RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture

2024/6/27

There are two common ways in which developers are incorporating proprietary and domain-specific data

Seven Failure Points When Engineering a Retrieval Augmented Generation System

2024/6/26

Software engineers are increasingly adding semantic search capabilities to applications using a stra

Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

2024/6/25

Language agents perform complex tasks by using tools to execute each step precisely. However, most e

Recurrent Context Compression: Efficiently Expanding the Context Window of LLM

2024/6/24

To extend the context length of Transformer-based large language models (LLMs) and improve comprehen

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs

2024/6/21

Retrieval Augmented Generation (RAG) enhances the abilities of Large Language Models (LLMs) by enabl

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning

2024/6/20

Simultaneous speech-to-speech translation (Simul-S2ST, a.k.a streaming speech translation) outputs t

VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

2024/6/19

We introduce VASA, a framework for generating lifelike talking faces with appealing visual affective

”Do Anything Now”: Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

2024/6/18

The misuse of large language models (LLMs) has drawn significant attention from the general public a

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

2024/6/17

Large Language Models (LLMs) are often described as being instances of foundation models - that is,

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

2024/6/13

We introduce Buffer of Thoughts (BoT), a novel and versatile thought-augmented reasoning approach fo

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

2024/6/12

Knowledge Graphs (KGs) represent human-crafted factual knowledge in the form of triplets (head, rela

AutoCoder: Enhancing Code Large Language Model with \textsc{AIEV-Instruct}

2024/6/11

We introduce AutoCoder, the first Large Language Model to surpass GPT-4 Turbo (April 2024) and GPT-4

From Sora What We Can See: A Survey of Text-to-Video Generation

2024/6/4

With impressive achievements made, artificial intelligence is on the path forward to artificial gene

The Future of Large Language Model Pre-training is Federated

2024/6/3

Generative pre-trained large language models (LLMs) have demonstrated impressive performance over a

Long-form factuality in large language models

2024/6/1

Large language models (LLMs) often generate content that contains factual errors when responding to

Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head

2024/5/31

End-to-end transformer-based detectors (DETRs) have shown exceptional performance in both closed-set

Episodes