-
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU
Paper • 2502.08910 • Published • 150 -
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens
Paper • 2502.18890 • Published • 30 -
MPO: Boosting LLM Agents with Meta Plan Optimization
Paper • 2503.02682 • Published • 29 -
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Paper • 2505.20411 • Published • 96
Jeffrey Yang Fan Chiang
RandomHakkaDude
AI & ML interests
GenAI, LLMs
Recent Activity
liked a dataset about 9 hours ago
badlogicgames/pi-mono upvoted a paper 7 months ago
DynaGuard: A Dynamic Guardrail Model With User-Defined Policies liked a model 11 months ago
nvidia/Nemotron-4-340B-Instruct