YuyaoGe's Website
YuyaoGe's Website
About
Highlight Publications
Other Publications
Projects
Posts
Light
Dark
Automatic
Home
Tags
Academic
Academic
Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning
Vision-Language Models (VLMs) have demonstrated remarkable success across diverse visual tasks, yet their performance degrades in …
Yuyao Ge 葛钰峣
,
Shenghua Liu
,
Yiwei Wang
,
Lingrui Mei
,
Baolong Bi
,
Xuanshan Zhou
,
Jiayu Yao
,
Jiafeng Guo
,
Xueqi Cheng
PDF
Cite
DOI
arXiv
Are All Prompt Components Value-Neutral? Understanding the Heterogeneous Adversarial Robustness of Dissected Prompt in Large Language Models
Abstract: Prompt-based adversarial attacks have become an effective means to assess the robustness of large language models (LLMs). However, existing approaches often treat prompts as monolithic text, overlooking their structural heterogeneity-different prompt components contribute unequally to adversarial robustness.
Yujia Zheng
,
Tianhao Li
,
Haotian Huang
,
Tianyu Zeng
,
Jingyu Lu
,
Chuangxin Chu
,
Yuekai Huang
,
Ziyou Jiang
,
Qian Xiong
,
Yuyao Ge 葛钰峣
,
Mingyang Li
Aug 3, 2025
PDF
Cite
DOI
arXiv
Can Graph Descriptive Order Affect Solving Graph Problems with LLMs?
In this work, we propose a Dynamically Adaptive Density Control Strategy based on the degree of reconstruction of the background of the scene, which adaptive the spatial sample point generation strategy dynamically according to the training results and prevents the generation of redundant data in the model.
Yuyao Ge 葛钰峣
,
Shenghua Liu
,
Baolong Bi
,
Yiwei Wang
,
Lingrui Mei
,
Wenjie Feng
,
Lizhe Chen
,
Xueqi Cheng
PDF
Cite
Code
Slides
Video
DOI
Paper
Website
Poster
A Survey of Context Engineering for Large Language Models
The performance of Large Language Models (LLMs) is fundamentally determined by the contextual information provided during inference. …
Lingrui Mei
,
Jiayu Yao
,
Yuyao Ge 葛钰峣
,
Yiwei Wang
,
Baolong Bi
,
Yujun Cai
,
Jiazhi Liu
,
Mingyu Li
,
Zhong-Zhi Li
,
Duzhen Zhang
,
Chenlin Zhou
,
Jiayi Mao
,
Tianze Xia
,
Jiafeng Guo
,
Shenghua Liu
PDF
Cite
Code
DOI
arXiV
Hugging Face
Who is in the Spotlight: The Hidden Bias Undermining Multimodal Retrieval-Augmented Generation
Abstract: Multimodal Retrieval-Augmented Generation (RAG) systems have become essential in knowledge-intensive and open-domain tasks. As retrieval complexity increases, ensuring the robustness of these systems is critical. However, current RAG models are highly sensitive to the order in which evidence is presented, often resulting in unstable performance and biased reasoning, particularly as the number of retrieved items or modality diversity grows.
Jiayu Yao
,
Shenghua Liu
,
Yiwei Wang
,
Lingrui Mei
,
Baolong Bi
,
Yuyao Ge 葛钰峣
,
Zhecheng Li
,
Xueqi Cheng
May 30, 2025
PDF
Cite
DOI
arXiv
PIS:Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
Abstract: Large language models (LLMs) have achieved remarkable progress, demonstrating unprecedented capabilities across various natural language processing tasks. However, the high costs associated with such exceptional performance limit the widespread adoption of LLMs, highlighting the need for prompt compression.
Lizhe Chen
,
Binjia Zhou
,
Yuyao Ge 葛钰峣
,
Jiayi Chen
,
Shiguang Ni
Apr 23, 2025
PDF
Cite
DOI
arXiv
a1: Steep Test-time Scaling Law via Environment Augmented Generation
Large Language Models (LLMs) have made remarkable breakthroughs in reasoning, yet continue to struggle with hallucinations, logical …
Lingrui Mei
,
Shenghua Liu
,
Yiwei Wang
,
Baolong Bi
,
Yuyao Ge 葛钰峣
,
Jun Wan
,
Yurong Wu
,
Xueqi Cheng
Apr 20, 2025
PDF
Cite
DOI
Innate Reasoning is Not Enough : In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking
TBD
Yuyao Ge 葛钰峣
,
Shenghua Liu
,
Yiwei Wang
,
Lingrui Mei
,
Lizhe Chen
,
Baolong Bi
,
Xueqi Cheng
PDF
Cite
arXiv
Translating Words to Worlds: Zero-Shot Synthesis of 3D Terrain from Textual Descriptions Using Large Language Models
Abstract: The current research on text-guided 3D synthesis predominantly utilizes complex diffusion models, posing significant challenges in tasks like terrain generation. This study ventures into the direct synthesis of text-to-3D terrain in a zero-shot fashion, circumventing the need for diffusion models.
Guangzi Zhang
,
Lizhe Chen
,
Yu Zhang
,
Yan Liu
,
Yuyao Ge 葛钰峣
,
Xingquan Cai
Jul 1, 2024
PDF
Cite
DOI
Frequency-Importance Gaussian Splatting for Real-Time Lightweight Radiance Field Rendering
In this work, we propose a Dynamically Adaptive Density Control Strategy based on the degree of reconstruction of the background of the scene, which adaptive the spatial sample point generation strategy dynamically according to the training results and prevents the generation of redundant data in the model.
Lizhe Chen
,
Yan Hu
,
Yu Zhang
,
Yuyao Ge 葛钰峣
,
Haoyu Zhang
,
Xingquan Cai
Feb 21, 2024
PDF
Cite
DOI
»
Cite
×