[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-tether-turboquant-cuts-ai-memory-use-5x-en":3,"article-related-tether-turboquant-cuts-ai-memory-use-5x-en":30,"series-blockchain-0117641d-93d6-40f1-8b9e-158b8240493a":83},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":12,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":22,"views":26,"created_at":27,"published_at":28,"topic_cluster_id":29},"0117641d-93d6-40f1-8b9e-158b8240493a","tether-turboquant-cuts-ai-memory-use-5x-en","Tether’s TurboQuant cuts AI memory use 5x","\u003Cp data-speakable=\"summary\">Tether released \u003Ca href=\"\u002Ftag\u002Fturboquant\">TurboQuant\u003C\u002Fa> in QVAC SDK 0.12.0 to cut AI memory use by up to 5x.\u003C\u002Fp>\u003Cp>Tether’s Artificial Intelligence Research Group has released TurboQuant in production form, bundling it into \u003Ca href=\"https:\u002F\u002Ftether.io\" target=\"_blank\" rel=\"noopener\">Tether\u003C\u002Fa>’s QVAC SDK 0.12.0. The company says the open-source method, originally developed by Google Research, can reduce \u003Ca href=\"\u002Ftag\u002Fkv-cache\">KV cache\u003C\u002Fa> memory demands by up to five times for local AI workloads.\u003C\u002Fp>\u003Ctable>\u003Cthead>\u003Ctr>\u003Cth>項目\u003C\u002Fth>\u003Cth>數值\u003C\u002Fth>\u003C\u002Ftr>\u003C\u002Fthead>\u003Ctbody>\u003Ctr>\u003Ctd>Memory reduction claim\u003C\u002Ftd>\u003Ctd>Up to 5x\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>SDK version\u003C\u002Ftd>\u003Ctd>QVAC SDK 0.12.0\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Model example\u003C\u002Ftd>\u003Ctd>4 billion parameters\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Context window example\u003C\u002Ftd>\u003Ctd>262,000 tokens\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>KV cache memory example\u003C\u002Ftd>\u003Ctd>About 8 GB\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Four simultaneous sessions\u003C\u002Ftd>\u003Ctd>About 32 GB\u003C\u002Ftd>\u003C\u002Ftr>\u003C\u002Ftbody>\u003C\u002Ftable>\u003Ch2>What changed\u003C\u002Fh2>\u003Cp>TurboQuant targets one of the main bottlenecks in on-device AI: memory pressure from the KV cache, which stores context during long conversations and document analysis. Tether says the new compression approach preserves model quality while shrinking memory use enough to make longer local sessions practical on consumer hardware.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780543069267-cwa3.png\" alt=\"Tether’s TurboQuant cuts AI memory use 5x\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>The update is integrated into QVAC SDK 0.12.0 and tied into \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fggml-org\u002Fllama.cpp\" target=\"_blank\" rel=\"noopener\">Fabric\u003C\u002Fa>, a core part of the QVAC stack. Tether says the SDK packages the libraries, tools, runtime components, quantization pipelines, framework adapters, documentation, and workload profiles developers need to build local AI apps.\u003C\u002Fp>\u003Cul>\u003Cli>TurboQuant is now in production release.\u003C\u002Fli>\u003Cli>The code is open source and based on Google Research work.\u003C\u002Fli>\u003Cli>It is designed for laptops, smartphones, edge devices, and decentralized networks.\u003C\u002Fli>\u003Cli>Tether says it can help users inspect long documents without sending them to cloud servers.\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>Why it matters\u003C\u002Fh2>\u003Cp>For developers, the pitch is simpler deployment of local AI tools that can handle larger contexts without expensive cloud \u003Ca href=\"\u002Ftag\u002Finference\">inference\u003C\u002Fa>. That matters for startups and independent teams trying to ship assistants, document tools, or edge apps without tying every request to a remote data center.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780543065639-djq3.png\" alt=\"Tether’s TurboQuant cuts AI memory use 5x\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>For users, Tether is framing the update around privacy and control. CEO Paolo Ardoino said people should be able to run long or sensitive tasks on their own devices instead of routing them through cloud infrastructure every time.\u003C\u002Fp>\u003Cp>The release also pushes Tether further into AI software, not just \u003Ca href=\"\u002Ftag\u002Fstablecoins\">stablecoins\u003C\u002Fa>. The company’s bet is that efficiency and portability will matter as much as raw compute for the next wave of AI products.\u003C\u002Fp>\u003Cp>The open question is whether TurboQuant becomes a useful local-AI building block or just another \u003Ca href=\"\u002Ftag\u002Fbenchmark\">benchmark\u003C\u002Fa> win that is hard to turn into real-world adoption.\u003C\u002Fp>","Tether released TurboQuant in QVAC SDK 0.12.0, claiming up to 5x lower AI memory use for local sessions on laptops and phones.","en.coin-turk.com","https:\u002F\u002Fen.coin-turk.com\u002Ftethers-latest-ai-upgrade-can-shrink-memory-by-up-to-5-times-what-do-investors-need-to-watch\u002F",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780543069267-cwa3.png","blockchain","en","c054df55-967a-4a5a-8d7b-be8df18ee4a1",[17,18,19,20,21],"Tether","TurboQuant","local AI","QVAC SDK","KV cache",[23,24,25],"TurboQuant claims up to 5x lower memory use for AI sessions.","The feature ships in QVAC SDK 0.12.0 with local-AI tooling.","Tether is positioning privacy and on-device AI as the main use case.",0,"2026-06-04T03:17:20.409795+00:00","2026-06-04T03:17:20.403+00:00","309d224a-d257-420a-88f2-6167bf5c2b81",{"tags":31,"relatedLang":42,"relatedPosts":46},[32,34,36,38,40],{"name":21,"slug":33},"kv-cache",{"name":20,"slug":35},"qvac-sdk",{"name":17,"slug":37},"tether",{"name":19,"slug":39},"local-ai",{"name":18,"slug":41},"turboquant",{"id":15,"slug":43,"title":44,"language":45},"tether-turboquant-cuts-ai-memory-use-5x-zh","Tether TurboQuant 讓 AI 記憶體降 5 倍","zh",[47,53,59,65,71,77],{"id":48,"slug":49,"title":50,"cover_image":51,"image_url":51,"created_at":52,"category":13},"e02bdaf5-581b-4c86-bb4c-2a376546372f","coinstats-api-turns-crypto-data-into-one-stack-en","CoinStats API turns crypto data into one stack","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780545810223-iayp.png","2026-06-04T04:03:07.140336+00:00",{"id":54,"slug":55,"title":56,"cover_image":57,"image_url":57,"created_at":58,"category":13},"6f84477d-86d3-4997-aa82-2eaaaeb3afbc","crypto-legality-by-country-banned-legal-unclear-en","Crypto legality by country: where it’s legal, banned, or unclear","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780534976501-rh3i.png","2026-06-04T01:02:33.354717+00:00",{"id":60,"slug":61,"title":62,"cover_image":63,"image_url":63,"created_at":64,"category":13},"1f95a530-8605-4c0a-8eb2-ba616ec546f8","4-ways-us-bitcoin-perpetuals-could-reshape-crypto-en","4 ways U.S. bitcoin perpetuals could reshape crypto","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780532275478-qmkc.png","2026-06-04T00:17:27.126342+00:00",{"id":66,"slug":67,"title":68,"cover_image":69,"image_url":69,"created_at":70,"category":13},"a4338994-d850-4553-9f60-989eb934316f","near-protocol-price-263-volume-jumps-en","NEAR Protocol price hits $2.63 as volume jumps","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780413490805-7ww1.png","2026-06-02T15:17:39.109501+00:00",{"id":72,"slug":73,"title":74,"cover_image":75,"image_url":75,"created_at":76,"category":13},"b6374446-26e7-4506-ad12-42dc188e3762","gemini-ai-solana-price-prediction-june-2026-en","Gemini AI Sees Solana at $160 by June 2026","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780265868744-bkwg.png","2026-05-31T22:17:26.72516+00:00",{"id":78,"slug":79,"title":80,"cover_image":81,"image_url":81,"created_at":82,"category":13},"79053d2b-fa5c-48e7-8155-9782db51f3ac","5-web3-applications-for-enterprise-teams-2026-en","5 Web3 Applications for Enterprise Teams in 2026","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780246069845-5fhc.png","2026-05-31T16:47:21.759009+00:00",[84,89,94,99,104,109,114,119,124,129],{"id":85,"slug":86,"title":87,"created_at":88},"cdf2780b-1da6-4aca-a87b-f0974b815b03","moonpay-open-wallet-standard-ai-payments-en","MoonPay's Open Wallet Standard Targets AI Payments","2026-03-28T03:08:33.547032+00:00",{"id":90,"slug":91,"title":92,"created_at":93},"f06da3a4-3b15-4c7b-a250-6077505f5119","next-gen-crypto-simulators-ai-web3-training-en","Next-Gen Crypto Simulators Are Getting Smarter","2026-04-01T09:36:34.200192+00:00",{"id":95,"slug":96,"title":97,"created_at":98},"0794f597-b908-402a-b660-729034ffdbf6","rtk-cuts-claude-code-token-spend-en","RTK cuts Claude Code token spend fast","2026-04-01T10:24:29.50277+00:00",{"id":100,"slug":101,"title":102,"created_at":103},"5101ffbf-7ea9-4baa-b5e2-64729ff55b20","openclaw-flaw-exposes-ai-admin-hijack-risk-en","Openclaw Flaw Exposes AI Admin Hijack Risk","2026-04-01T13:12:33.481569+00:00",{"id":105,"slug":106,"title":107,"created_at":108},"fadea65e-f7c8-41b0-a186-809d21787b4c","how-web3-marketing-changed-in-2026-en","How Web3 Marketing Changed in 2026","2026-04-02T01:36:36.504086+00:00",{"id":110,"slug":111,"title":112,"created_at":113},"88f88741-ff27-41d1-8151-776d0afb9508","ai-agentic-defi-web3-grants-march-2026-en","AI, Agentic DeFi, and Web3 Grants to Watch","2026-04-02T05:51:37.696422+00:00",{"id":115,"slug":116,"title":117,"created_at":118},"43fafe43-772e-48c8-bb95-da8d64cf60e3","why-crypto-is-fixated-on-ai-agents-en","Why Crypto Is Fixated on AI Agents","2026-04-02T05:54:29.121481+00:00",{"id":120,"slug":121,"title":122,"created_at":123},"320ef5e4-fe56-47ab-9a92-290d6fbd3f60","web3-explained-what-it-is-why-it-matters-en","Web3 Explained: What It Is and Why It Matters","2026-04-02T06:15:33.001112+00:00",{"id":125,"slug":126,"title":127,"created_at":128},"f49cffaf-2c57-4f48-9486-7062cca91ba0","trust-wallet-ai-trading-agents-220m-users-en","Trust Wallet Adds AI Trading Agents for 220M Users","2026-04-02T06:24:28.043029+00:00",{"id":130,"slug":131,"title":132,"created_at":133},"2b8501e2-39af-4de3-ade1-29616a58e9fb","trust-wallet-agent-kit-ai-trade-25-chains-en","Trust Wallet's Agent Kit Lets AI Trade on 25+ Chains","2026-04-02T06:27:33.425312+00:00"]