[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-deterministic-multicalibration-optimal-sample-use-en":3,"article-related-deterministic-multicalibration-optimal-sample-use-en":30,"series-research-66286461-18c3-42a2-a053-16a87b9a0dd0":73},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":12,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":22,"views":26,"created_at":27,"published_at":28,"topic_cluster_id":29},"66286461-18c3-42a2-a053-16a87b9a0dd0","deterministic-multicalibration-optimal-sample-use-en","Deterministic multicalibration finally hits optimal sample use","\u003Cp data-speakable=\"summary\">This paper shows multicalibration and omniprediction can be made deterministic without giving up optimal sample complexity.\u003C\u002Fp>\u003Cul>\u003Cli>\u003Cstrong>Research org\u003C\u002Fstrong>: Unspecified in arXiv abstract\u003C\u002Fli>\u003Cli>\u003Cstrong>Core data\u003C\u002Fstrong>: tilde O(epsilon^{-3}) sample complexity\u003C\u002Fli>\u003Cli>\u003Cstrong>Breakthrough\u003C\u002Fstrong>: Deterministic algorithm for minimax-optimal multicalibration\u003C\u002Fli>\u003C\u002Ful>\u003Cp>For engineers building decision systems, the practical question is not just whether a model is accurate, but whether its probabilities stay trustworthy across the slices and reweightings that matter in deployment. This paper tackles that problem at the level of calibration guarantees, and it matters because calibration is often the difference between a score you can operationalize and one you can only inspect.\u003C\u002Fp>\u003Cp>The big news here is simple: the authors show that you do not need randomization to get the best-known sample complexity for multicalibration. That closes an open question raised in prior work and gives a deterministic route to guarantees that were previously only known through randomized predictors.\u003C\u002Fp>\u003Ch2>What problem this paper is trying to fix\u003C\u002Fh2>\u003Cp>Multicalibration is a stronger version of calibration. A predictor is not only supposed to be unbiased overall; it should remain unbiased even after you condition on its own prediction and after you reweight the data by a collection of group weights G. In other words, the model should not just look calibrated in the aggregate while hiding systematic mistakes in subpopulations or under specific test reweightings.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781850768283-gcmj.png\" alt=\"Deterministic multicalibration finally hits optimal sample use\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>That matters in downstream applications where predictions feed into decisions, ranking, resource allocation, or auditing. If a model says 0.8, engineers want that to mean something stable across relevant groups, not just on the full dataset. Multicalibration is one of the formal tools for making that promise precise.\u003C\u002Fp>\u003Cp>The paper focuses on a long-standing gap: before this work, the minimax-optimal tilde O(epsilon^{-3}) sample complexity rate for epsilon-multicalibration was only known for randomized predictors. Deterministic predictors existed, but with substantially worse sample complexity. The question was whether randomness was actually necessary to hit the optimal rate.\u003C\u002Fp>\u003Ch2>How the method works in plain English\u003C\u002Fh2>\u003Cp>The abstract does not spell out the full algorithmic machinery, so it does not let us reconstruct the implementation step by step. What it does tell us is the key structural result: the authors give a minimax-optimal multicalibration algorithm whose output predictor is deterministic.\u003C\u002Fp>\u003Cp>That is the core engineering idea. Instead of relying on stochastic prediction to achieve the calibration guarantee, the algorithm is designed so the final predictor itself is fixed and repeatable while still meeting the same sample-complexity target. For practitioners, that is appealing because deterministic outputs are easier to reason about, test, reproduce, and deploy.\u003C\u002Fp>\u003Cp>The authors then generalize the same approach beyond multicalibration. They extend it to outcome indistinguishability, or OI, with respect to finite or finitely covered collections of tests. From there, they derive deterministic omnipredictors and panpredictors with optimal sample complexity.\u003C\u002Fp>\u003Ch2>What the paper actually shows\u003C\u002Fh2>\u003Cp>The central result is a resolution of the open question about whether randomization is necessary for optimal multicalibration sample complexity. According to the abstract, the answer is no: the paper provides a deterministic predictor that achieves the minimax-optimal tilde O(epsilon^{-3}) rate for epsilon-multicalibration.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781850771729-0yef.png\" alt=\"Deterministic multicalibration finally hits optimal sample use\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>The abstract does not provide \u003Ca href=\"\u002Ftag\u002Fbenchmark\">benchmark\u003C\u002Fa> tables, empirical results, or dataset-specific numbers. So there is no reported accuracy lift, runtime comparison, or wall-clock speedup to cite here. What we do have is a theoretical guarantee: the sample complexity is optimal in the minimax sense, and the predictor is deterministic.\u003C\u002Fp>\u003Cp>The second result is a broader guarantee for outcome indistinguishability. The paper says the algorithm can be adapted to finite or finitely covered collections of tests, which then yields deterministic omnipredictors and panpredictors with optimal sample complexity. The abstract frames this as resolving open problems posed in prior work.\u003C\u002Fp>\u003Cp>That matters because omniprediction and panprediction are not just niche theory terms; they are part of the broader toolkit for building predictors that behave well across a family of downstream tasks. If you are trying to design systems whose outputs can be reused safely across multiple decision rules, this kind of guarantee is exactly the sort of thing you care about.\u003C\u002Fp>\u003Ch2>Why developers should care\u003C\u002Fh2>\u003Cp>For a developer, deterministic guarantees are easier to operationalize than randomized ones. A deterministic predictor is simpler to debug, simpler to reproduce in audits, and less awkward to integrate into pipelines where repeated evaluations should not drift because of sampling noise.\u003C\u002Fp>\u003Cp>Multicalibration also speaks directly to fairness and reliability workflows. If your model is used on groups, slices, or test families, you want to know whether its confidence scores are meaningful after conditioning and reweighting. This paper says you can get that kind of guarantee without paying an extra sample-complexity penalty just because you insisted on determinism.\u003C\u002Fp>\u003Cp>There is also a broader architectural lesson here: sometimes what looks like a mathematical convenience, such as randomization, is not actually required for the best asymptotic guarantee. That can simplify deployment choices in settings where deterministic \u003Ca href=\"\u002Ftag\u002Finference\">inference\u003C\u002Fa> is preferable for governance, compliance, or reproducibility reasons.\u003C\u002Fp>\u003Ch2>Limitations and open questions\u003C\u002Fh2>\u003Cp>The abstract is strong on theory but thin on implementation detail. It does not tell us how the algorithm behaves in practice, how expensive it is computationally, or whether the deterministic construction is easy to implement at scale. Those are important questions for anyone thinking about production use.\u003C\u002Fp>\u003Cp>It also does not include empirical benchmarks, so there is no evidence here about real-world calibration error, latency, or memory use. The result is about sample complexity and theoretical optimality, not an end-to-end systems evaluation.\u003C\u002Fp>\u003Cp>Even so, the paper closes a clean theoretical loop. It answers a question explicitly raised in prior work: randomization is not necessary to reach the minimax-optimal sample complexity for multicalibration, and the same deterministic framing extends to OI, omniprediction, and panprediction.\u003C\u002Fp>\u003Cp>For anyone working on trustworthy ML infrastructure, that is a useful result to have in hand. It sharpens the boundary between what calibration theory requires and what was merely an artifact of earlier constructions.\u003C\u002Fp>\u003Ch2>Bottom line\u003C\u002Fh2>\u003Cp>This paper gives a deterministic path to optimal multicalibration and extends that guarantee to related prediction frameworks. The abstract does not give empirical benchmarks, but it does settle an open theoretical question that matters for how dependable prediction systems are built and audited.\u003C\u002Fp>\u003Cul>\u003Cli>Deterministic predictors can match the best-known tilde O(epsilon^{-3}) multicalibration sample complexity.\u003C\u002Fli>\u003Cli>The result extends to outcome indistinguishability, omniprediction, and panprediction.\u003C\u002Fli>\u003Cli>The abstract provides theory, not implementation details or empirical benchmarks.\u003C\u002Fli>\u003C\u002Ful>","This paper shows multicalibration and omniprediction can be made deterministic without giving up optimal sample complexity.","arxiv.org","https:\u002F\u002Farxiv.org\u002Fabs\u002F2606.20557",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781850768283-gcmj.png","research","en","ed7ed094-2671-4723-8105-a89dc805f8a9",[17,18,19,20,21],"multicalibration","deterministic predictors","omniprediction","calibration","sample complexity",[23,24,25],"Deterministic predictors can achieve minimax-optimal multicalibration sample complexity.","The method extends to outcome indistinguishability and related prediction frameworks.","The abstract reports theoretical results, not empirical benchmarks or implementation details.",0,"2026-06-19T06:32:28.768728+00:00","2026-06-19T06:32:28.76+00:00","3103988e-c4fe-45e3-98ab-846500c9d507",{"tags":31,"relatedLang":32,"relatedPosts":36},[],{"id":15,"slug":33,"title":34,"language":35},"deterministic-multicalibration-optimal-sample-use-zh","確定性多重校準終於達標","zh",[37,43,49,55,61,67],{"id":38,"slug":39,"title":40,"cover_image":41,"image_url":41,"created_at":42,"category":13},"405de39d-cfc5-43bf-b47b-ff9ce7be96a9","turboquant-does-not-hurt-search-quality-equal-bytes-en","TurboQuant does not hurt search quality at equal byte budgets","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781857967113-2xax.png","2026-06-19T08:32:22.235692+00:00",{"id":44,"slug":45,"title":46,"cover_image":47,"image_url":47,"created_at":48,"category":13},"6dc0410b-c9ec-4148-974b-0b5f7a14975c","uniego-proxy-teachers-egocentric-video-en","UNIEGO unifies egocentric video with proxy teachers","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781849887430-g735.png","2026-06-19T06:17:32.327109+00:00",{"id":50,"slug":51,"title":52,"cover_image":53,"image_url":53,"created_at":54,"category":13},"b398938d-f651-4d91-bfee-d888ba44fe6f","diffusiongemma-transparency-measured-en","DiffusionGemma’s transparency problem, measured","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781848969642-b497.png","2026-06-19T06:02:30.672396+00:00",{"id":56,"slug":57,"title":58,"cover_image":59,"image_url":59,"created_at":60,"category":13},"8abdf0aa-3fa8-4123-adec-4b0d3cd6b7de","nitro-split-kernel-isolation-math-en","Nitro’s split kernel turns isolation into math","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781843602176-04ij.png","2026-06-19T04:32:58.564142+00:00",{"id":62,"slug":63,"title":64,"cover_image":65,"image_url":65,"created_at":66,"category":13},"39d1ecdc-5ce6-45b7-af63-f1b74337311d","blackwell-wins-agentic-ai-infrastructure-benchmark-en","Blackwell wins because agentic AI needs full-stack infrastructure","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781803966380-s5kc.png","2026-06-18T17:32:18.823071+00:00",{"id":68,"slug":69,"title":70,"cover_image":71,"image_url":71,"created_at":72,"category":13},"d7f11606-750d-42ea-87b8-23a761269509","locus-local-ordinance-corpus-us-en","LOCUS opens U.S. local law for legal AI","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781764376812-ikxd.png","2026-06-18T06:32:30.210741+00:00",[74,79,84,89,94,99,104,109,114,119],{"id":75,"slug":76,"title":77,"created_at":78},"a2715e72-1fe8-41b3-abb1-d0cf1f710189","ai-predictions-2026-big-changes-en","AI Predictions for 2026: Brace for Big Changes","2026-03-26T01:25:07.788356+00:00",{"id":80,"slug":81,"title":82,"created_at":83},"8404bd7b-4c2f-4109-9ec4-baf29d88af2b","ml-papers-of-the-week-github-research-desk-en","ML Papers of the Week Turns GitHub Into a Research Desk","2026-03-27T01:11:39.480259+00:00",{"id":85,"slug":86,"title":87,"created_at":88},"87897a94-8065-4464-a016-1f23e89e17cc","ai-ml-conferences-to-watch-in-2026-en","AI\u002FML Conferences to Watch in 2026","2026-03-27T01:51:54.184108+00:00",{"id":90,"slug":91,"title":92,"created_at":93},"6f1987cf-25f3-47a4-b3e6-db0997695be8","openclaw-agents-manipulated-self-sabotage-en","OpenClaw Agents Can Be Manipulated Into Failure","2026-03-28T03:03:18.899465+00:00",{"id":95,"slug":96,"title":97,"created_at":98},"a53571ad-735a-4178-9f93-cb09b699d99c","vega-driving-language-instructions-en","Vega: Driving with Natural Language Instructions","2026-03-28T14:54:04.698882+00:00",{"id":100,"slug":101,"title":102,"created_at":103},"a34581d6-f36e-46da-88bb-582fb3e7425c","personalizing-autonomous-driving-styles-en","Drive My Way: Personalizing Autonomous Driving Styles","2026-03-28T14:54:26.148181+00:00",{"id":105,"slug":106,"title":107,"created_at":108},"2bc1ad7f-26ce-4f02-9885-803b35fd229d","training-knowledge-bases-writeback-rag-en","Training Knowledge Bases with WriteBack-RAG","2026-03-28T14:54:45.643433+00:00",{"id":110,"slug":111,"title":112,"created_at":113},"71adc507-3c54-4605-bbe2-c966acd6187e","packforcing-long-video-generation-en","PackForcing: Efficient Long-Video Generation Method","2026-03-28T14:55:02.646943+00:00",{"id":115,"slug":116,"title":117,"created_at":118},"675942ef-b9ec-4c5f-a997-381250b6eacb","pixelsmile-facial-expression-editing-en","PixelSmile Framework Enhances Facial Expression Editing","2026-03-28T14:55:20.633463+00:00",{"id":120,"slug":121,"title":122,"created_at":123},"6954fa2b-8b66-4839-884b-e46f89fa1bc3","adaptive-block-scaled-data-types-en","IF4: Smarter 4-Bit Quantization That Adapts to Your Data","2026-03-31T06:00:36.65963+00:00"]