{"ID":2888170,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2507.23170","arxiv_id":"2507.23170","title":"BAR Conjecture: the Feasibility of Inference Budget-Constrained LLM Services with Authenticity and Reasoning","abstract":"When designing LLM services, practitioners care about three key properties: inference-time budget, factual authenticity, and reasoning capacity. However, our analysis shows that no model can simultaneously optimize for all three. We formally prove this trade-off and propose a principled framework named The BAR Theorem for LLM-application design.","short_abstract":"When designing LLM services, practitioners care about three key properties: inference-time budget, factual authenticity, and reasoning capacity. However, our analysis shows that no model can simultaneously optimize for all three. We formally prove this trade-off and propose a principled framework named The BAR Theorem...","url_abs":"https://arxiv.org/abs/2507.23170","url_pdf":"https://arxiv.org/pdf/2507.23170v2","authors":"[\"Jinan Zhou\",\"Rajat Ghosh\",\"Vaishnavi Bhargava\",\"Debojyoti Dutta\",\"Aryan Singhal\"]","published":"2025-07-31T00:51:16Z","proceeding":"cs.LG","tasks":"[\"cs.LG\"]","methods":"[\"Large Language Model\"]","has_code":false}