Perplexity will push hybrid reasoning functionality
Coinpaper
23h ago
Ai Focus
Perplexity plans to launch hybrid inference in its Windows app in July, allowing some AI tasks to run locally on the user's device while the rest are handled by cloud-based models.
Helpful
No.Help

At Computex Taipei 2026, Perplexity announced a new feature: Perplexity Computer for Windows, scheduled for release in July. This system automatically determines which parts of an AI task run locally and which are handled by cloud-based models, eliminating the need for users to manually switch modes.

Sensitive content is processed locally first.

This solution was jointly announced by Perplexity CEO Aravind Srinivas and Intel CEO Liwu Chen. The company calls it a hybrid local-server inference orchestration system, which focuses on handling privacy, performance, and computing costs within the same process.

Perplexity states that data such as financial records, health information, and personal documents are best handled by a lightweight model on the device, which determines whether to keep them locally. Data requiring more complex reasoning is then sent to a larger model in the cloud for processing.

According to the company, tasks such as document summarization, text formatting, and lightweight classification can be completed locally; complex reasoning is handled by the server. The entire process switches automatically during task execution, minimizing the user's awareness.

However, this does not mean that Perplexity offers users a fully controllable offline model. The local components remain a compact model of Perplexity integrated into the application, and the cloud portion still runs through the Perplexity server; therefore, it cannot be considered a completely offline solution.

Cost pressure is an important background factor.

In an interview during Computex, Srinivas stated that the goal of AI systems should be to provide higher "value per watt" for each user, rather than concentrating all computing power on servers and the largest models. He noted that some companies are already spending hundreds of millions of dollars per month on computing power.

Perplexity previously disclosed that its revenue had increased from $100 million to $500 million, while its headcount had only grown by 34%. In this context, offloading some inference workloads to user computers can directly reduce cloud computing costs.

This is also one of the key reasons why the AI industry is currently pushing for edge-based inference. For businesses, local operation reduces server costs; for users, it means that some sensitive data does not have to leave the device.

The industry is shifting towards end-side and hybrid models.

Currently, many technology companies are advancing local or hybrid inference. Apple is performing some sensitive processing on its local chips; Microsoft's Foundry Local, which officially became available in April of this year, supports local AI inference on Windows, macOS, and Linux.

NVIDIA also launched RTX Spark at Computex, targeting local large model inference on laptops and desktop devices. In contrast, Perplexity's difference lies not in the model itself, but in the scheduling layer: the system determines the division of labor between local and cloud in real time based on the task, rather than allowing users to pre-select.

Perplexity stated that this feature is not limited to Intel chipsets. While the demonstration used an Intel Core Ultra Series 3 processor, Nvidia processors are also supported. Currently, the feature is only confirmed to launch initially on Windows PCs; release dates for other platforms have not yet been announced.

Tip
$0
Like
0
Save
0
Views 687
CoinMeta reminds readers to view blockchain rationally, stay aware of risks, and beware of virtual token issuance and speculation. All content on this site represents market information or related viewpoints only and does not constitute any form of investment advice. If you find sensitive content, please click“Report”,and we will handle it promptly。
Submit
Comment 0
Hot
Latest
No comments yet. Be the first!
Related
RedotPay extends XRP payment and transfer functionality
RedotPay expands XRP payment and transfer capabilities to cover everyday spending, international remittances, collateralized credit, and fiat currency exchange.
Coinpedia
·2026-05-29 16:34:53
374
Stanford study: AI outperforms law professors in legal reasoning.
A Stanford study found that law professors more often chose AI-generated answers in blind tests of contract law questions, with models such as Gemini and NotebookLM generally outperforming human answers.
Coinpaper
·2026-06-04 05:07:03
654
RLUSD integrates with Wormhole's native cross-chain transfer functionality.
RLUSD achieves native multi-chain transfers through Wormhole NTT, and the market is paying attention to its impact on the circulation of cross-chain stablecoins and the activity of the Wormhole ecosystem.
CoinPedia
·2026-06-04 23:58:00
117
US crypto lobbying groups push for the CLARITY bill.
The new PAC in the United States has intervened in the discussion of the Clarity Act, with developer protection, ethical provisions, and limits on stablecoin yields becoming the focus.
AMBCrypto
·2026-06-05 00:27:45
813
Coinbase intensifies its push for the passage of the Clarity Act.
Coinbase is pushing for the CLARITY Act, a bill to regulate the US crypto market, but it still needs 60 votes in the Senate to pass.
Cryptonews
·2026-06-02 05:14:19
142