Papers Read on AI

Retrieval-Augmented Generation for AI-Generated Content: A Survey

2024/5/29

Advancements in model algorithms, the growth of foundational models, and access to high-quality data

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

2024/5/28

Low-rank adaptation is a popular parameter-efficient fine-tuning method for large language models. I

LightAutoML: AutoML Solution for a Large Financial Services Ecosystem

2024/5/27

We present an AutoML system called LightAutoML developed for a large European financial services com

Efficient Multimodal Large Language Models: A Survey

2024/5/24

In the past year, Multimodal Large Language Models (MLLMs) have demonstrated remarkable performance

The Platonic Representation Hypothesis

2024/5/23

We argue that representations in AI models, particularly deep networks, are converging. First, we su

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

2024/5/22

Generative foundation models are susceptible to implicit biases that can arise from extensive unsupe

LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks

2024/5/21

Penetration testing, an essential component of software security testing, allows organizations to pr

CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval

2024/5/16

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object

A decoder-only foundation model for time-series forecasting

2024/5/14

Motivated by recent advances in large language models for Natural Language Processing (NLP), we desi

Autonomous LLM-driven research from data to human-verifiable research papers

2024/5/13

As AI promises to accelerate scientific discovery, it remains unclear whether fully AI-driven resear

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

2024/5/12

We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

2024/5/11

Large Language Models (LLMs) trained on code are revolutionizing the software development process. I

Improving Diffusion Models for Virtual Try-on

2024/5/10

This paper considers image-based virtual try-on, which renders an image of a person wearing a curate

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

2024/5/8

For recent diffusion-based generative models, maintaining consistent content across a series of gene

RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing

2024/5/7

Large Language Models (LLMs) have catalyzed significant advancements in Natural Language Processing

KAN: Kolmogorov-Arnold Networks

2024/5/6

Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KAN

Make Your LLM Fully Utilize the Context

2024/5/3

While many contemporary large language models (LLMs) can process lengthy input, they still struggle

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

2024/5/2

In this report, we introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to

Dynamic Generation of Personalities with Large Language Models

2024/4/30

In the realm of mimicking human deliberation, large language models (LLMs) show promising performanc

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

2024/4/25

The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale

Episodes