Guide

How to control your AI coding agent from your phone

How to control your AI coding agent from your phone

Here's the situation an AI coding agent puts you in that nothing before it did: you kick off a real task — "refactor the auth module, run the tests, open a PR" — and then there's nothing for you to do for ten minutes except wait. So you walk away. You make coffee, you take the dog out, you sit down to dinner. And somewhere in those ten minutes the agent finishes, or worse, stops and asks "can I run git push?" — and just sits there, blocked, until you're back at the keyboard.

The fix is to take the agent with you. Not the laptop — your phone. If you can watch the terminal stream live on your phone, type a follow-up, and tap "allow" on a prompt while you're nowhere near your desk, the dead time disappears. This guide covers what "remote control" actually has to do, the DIY ways to get there, where they fall short, and how to do it without putting your code on the open internet.

Desktop agent runs here claude / cursor Encrypted relay sees ciphertext only never your code Phone view stream type commands approve / deny
The agent keeps running on your desktop. A relay forwards the encrypted session to your phone, which can watch, type, and answer prompts — without the relay ever seeing your code in the clear.

Why you'd want to leave your desk mid-task

Agentic coding has a rhythm: a burst of describing what you want, then a long stretch where the agent works and you don't. That stretch is the whole point — it's the time the tool buys you back. But it's only yours if you can actually leave. If "leaving" means the agent silently blocks on the first prompt and you lose twenty minutes, you end up tethered to the desk anyway, watching a terminal do nothing. That's the exact babysitting problem in a different costume.

Phone control changes the math. The agent's long task overlaps with your life instead of pausing it. You check the stream from the couch, you fire the next task from the kitchen, you approve the push from the bus. The work continues; you just stop being the bottleneck.

What "remote control" actually has to do

It's tempting to think a screenshot or a "done" notification is enough. It isn't. A real remote setup has four jobs, and most DIY approaches only do one or two:

The DIY options, and where they break

You can absolutely rig this up yourself. People do. Each path gets you part of the way:

The pattern across all of these: the ones that let you actually type and approve are the ones that put a live shell on the network, and securing that is the hard part you inherit.

Doing it securely

This is the part worth slowing down on, because the failure mode is severe. A remotely reachable terminal is a remotely reachable terminal — if it's exposed and weakly authenticated, it's an open door to your whole machine. Three rules keep you safe:

Where Backgrind fits

Backgrind's Live mode is this guide, built in. The agent runs on your desktop in Backgrind's always-on-top overlay exactly as it normally would; Live mode mirrors that session to your phone or iPad through the browser. You watch the terminal stream in real time, send commands, switch between sessions, and answer "needs you" prompts — it's drive, not a read-only mirror.

The security model is the one above, by default. Pairing is desktop-initiated with a one-time code (scan a QR or paste it), the link is end-to-end encrypted with a per-pairing key, and the relay only ever forwards ciphertext — your code never sits readable on our server. There's no separate web account to create and nothing extra to trust.

Pair Live mode with several agents running at once and the whole loop comes together: tasks running in parallel on your machine, a chime when one needs a decision, and your phone as the remote that answers it. See it in the live demo.