Domain 4 · Task Statement 4.1
Computer Use Fundamentals
TL;DR
Understand how Computer Use enables Claude to see and control your screen, master the per-app permission model, learn the tool priority hierarchy, and recognise the screenshot-analyse-act execution cycle.
What You Need to Know
Computer Use is where Cowork crosses the line from processing files and querying APIs to physically driving your computer. When enabled, Claude can see your screen, move your cursor, click buttons, type text, and navigate between applications — the same actions you perform yourself, executed remotely by Claude through periodic screenshots and simulated input.
This isn't a metaphor. Claude literally takes a photograph of your display, analyses what it sees, decides where to click, moves the cursor, and repeats the cycle. It's the most powerful capability in Cowork, and the most dangerous if used carelessly. Understanding exactly how it works — and when not to use it — is the core of Domain 4.
Enabling Computer Use
Computer Use is disabled by default and available only on Pro and Max individual plans. You enable it in Settings > General via a toggle switch. This is a deliberate opt-in: Claude never touches your screen without your explicit decision to turn this on.
Exam Trap: Plan Availability
Computer Use requires a Pro or Max individual plan. It isn't currently available on Team or Enterprise plans. Any exam scenario that describes an organisation-wide deployment of Computer Use on a Team plan contains a false premise — check the plan tier before evaluating the rest of the question.
How Claude sees your screen
Claude doesn't have direct access to your operating system, application APIs, or the accessibility layer. Instead, it perceives your screen by taking periodic screenshots and analysing the image. It reads text, identifies buttons, recognises UI elements, and determines what's visible — all through visual interpretation of a photograph.
This has a critical security implication: anything visible on your screen is visible to Claude. That includes the target application you want Claude to work in, but also every background window, notification banner, browser tab, and desktop icon. If your email shows a confidential client contract behind the app Claude is working in, that information is captured in the screenshot and processed by the model.
Operational control: the screenshot-analyse-act cycle
Once Claude can see the screen, it controls your computer through a repeating cycle:
- Screenshot — Claude captures an image of your current display
- Analyse — the model interprets what it sees: text, buttons, fields, menus, layout
- Act — Claude moves the cursor, clicks, types, scrolls, or navigates based on its analysis
- Repeat — another screenshot is taken to observe the result, and the cycle continues
This cycle is inherently slower than connector-based operations. A Connector delivers structured data in seconds through an API. Computer Use must photograph the screen, process the image, decide on an action, execute it, wait for the UI to respond, and photograph again. A task that takes two seconds via a Connector might take two minutes via Computer Use.
The per-app permission model
Computer Use follows a zero-trust model. There's no "allow everything" switch. Each time Claude needs to interact with a new application, it must request your explicit permission. Approve Calculator, and Claude can use Calculator. That approval doesn't extend to Safari, Mail, Finder, or any other application. Every app is a separate door that requires its own key.
Exam Trap: No Global Permission Toggle
A common distractor claims that Claude can bypass per-app permission prompts by enabling a "Global Permission" toggle in Settings. This doesn't exist. The per-app model is non-negotiable — Claude must request and receive approval for each individual application, every time.
This means the permission overhead scales with the number of applications involved in a task. A simple task using one app requires one approval. A complex workflow spanning five apps requires five separate approvals. Factor this into your workflow planning.
The tool priority hierarchy
Claude follows an efficiency hierarchy when deciding how to complete a task:
- Connectors — structured API access, fastest and most reliable (pre-prepped ingredients in the fridge)
- Browser — web navigation, slower but works for sites without Connectors (the pantry down the hall)
- Computer Use — screen interaction, slowest but handles any visible application (walking out to the garden)
This hierarchy isn't just a performance recommendation — it reflects reliability too. Connectors deliver precise, structured data with near-100% success. Browser navigation is less reliable because websites change layout, load slowly, or require authentication. Computer Use has roughly a 50% success rate on complex multi-step tasks because it depends on visual interpretation of an ever-changing screen.
The practical rule: always use the highest-priority tool available for each step. If a Google Calendar Connector is installed, never ask Claude to visually navigate the Calendar website. If a Salesforce Connector exists, never ask Claude to drive the Salesforce desktop app via screen clicks. Reserve Computer Use for applications that have no API, no web interface, and no Connector — legacy desktop software that can only be operated through a GUI.
Computer Use operates outside the sandbox
This is the most important distinction to grasp. File processing, Skills, and Connectors all run inside Cowork's sandboxed environment. Computer Use doesn't. It interacts with your actual desktop, your physical mouse and keyboard, your real applications. There's no VM, no container, no safety net beyond the per-app permission model and the blocked application categories.
This is why Domain 2's folder scoping and Domain 3's Connector-based workflows are preferred: they operate within controlled boundaries. Computer Use breaks those boundaries by design — which is precisely what makes it both powerful and risky.
Common Mistakes
Common Mistake
Leaving sensitive windows open in the background — email, banking, password managers, confidential documents — and starting a Computer Use session without realising Claude screenshots the entire screen.
Instead: Before any Computer Use session, minimise or close every window that contains sensitive information. Claude captures the entire visible display, not just the target application. Treat the screen like a shared workspace: if you wouldn't want a colleague to see it, close it.
Common Mistake
Starting a Computer Use task and then closing the laptop lid or letting the screen lock — expecting Claude to continue working in the background.
Instead: Computer Use depends on a live, visible screen. If the display goes dark, the application closes, or the machine sleeps, Claude loses its ability to see and interact. Keep the computer awake and the application open for the entire task duration.
Common Mistake
Using Computer Use to check Google Calendar or query Gmail when a Connector for that service is already installed — burning minutes on screen navigation that a Connector could complete in seconds.
Instead: Always follow the tool priority hierarchy: Connectors first, browser second, Computer Use last. Check which Connectors are installed before reaching for screen interaction. If a Connector exists for the service, use it.
Checking your calendar
Before
Check my calendar for next week.
After
Use the Google Calendar Connector to retrieve all events from 7 April to 11 April 2026 and list them with times and locations.
Working with a desktop application
Before
Fix my budget spreadsheet.
After
Open Excel, find the file '2026_Budget.xlsx' in my Documents folder, go to the Expenses tab, highlight all empty rows in Column C with yellow fill, and save the file.
Hands-On Activity
Hands-On Activity
Enable Computer Use and Test Per-App Permissions
Enable Computer Use, trigger the per-app permission prompt, observe the screenshot-analyse-act cycle in real time, and understand the speed difference between Computer Use and connector-based operations.
What you will learn
- Enable Computer Use from Settings and understand its plan requirements
- Trigger and approve a per-app permission prompt
- Observe the screenshot-analyse-act execution cycle in real time
- Compare the speed of Computer Use against connector-based alternatives
- 01
Verify you're on a Pro or Max plan by checking Settings > Subscription in Claude Desktop. Ensure you're running the latest version of the app.
Why: Computer Use is only available on Pro and Max plans and requires the latest app version. Attempting to enable it on a lower plan will fail silently.
Expected: Your subscription tier displayed as Pro or Max, and the app version matching the latest release.
- 02
Navigate to Settings > General and enable the Computer Use toggle.
Why: Computer Use is disabled by default — this is an intentional safety measure so Claude never controls your screen without your explicit opt-in.
Expected: The Computer Use toggle switched to the 'on' position.
- 03
In a Cowork conversation, type: 'Open Calculator and compute 147 multiplied by 23. Tell me the result.'
Why: Calculator is a safe, low-risk application to test Computer Use. This task verifies that Claude can see the screen, request per-app permission, interact with UI elements, and read the result.
Expected: Claude requests permission to interact with Calculator. After you approve, you should see Claude move the cursor, click the number buttons, and report the result (3,381).
- 04
Watch the execution carefully: observe how Claude takes screenshots, moves the cursor, clicks buttons, and reads the screen. Note the slight delay between each action.
Why: Understanding the screenshot-analyse-act cycle helps you set realistic expectations for task speed and explains why Computer Use is the slowest tool in the priority hierarchy.
Expected: A visible sequence of cursor movements and clicks with brief pauses between each action as Claude takes a new screenshot and decides the next step.
Practice Question
Practice Question
You are at a conference and need a critical report exported from a legacy desktop-only accounting application on your laptop back at the hotel. The application has no API or web interface. Your laptop is open with Claude Desktop running. How should you retrieve the report?
Sources
- Let Claude use your computer in Cowork — Anthropic
- Put Claude to work on your computer — Anthropic
- Claude Dispatch: Control Cowork From Your Phone — FindSkill.ai