Talking to Claude Code and Codex

16 Jul 202616 Jul 2026 ~ Jon Udell ~ Leave a comment

Handwriting was always problematic for me. My fifth-grade teacher, Mrs. Cloud, placed a high value on well-formed cursive strokes that she flowed smoothly onto the blackboard. At my desk I struggled to copy her examples and failed miserably. In middle school, with no one judging my handwriting, I abandoned cursive in favor of printing my letters which was slower but at least I could read what I wrote.

In college, needing to take notes faster than I could print them, I forced myself to relearn cursive. In the 1970s my portable device wasn’t a laptop computer, it was an electric typewriter that I used only for final copy. I composed in longhand on yellow legal pads. By the early 1980s, when it finally became possible to compose on a computer, I thought I’d left handwriting behind forever. Take that, Mrs. Cloud! No more clumsy scribbling with pen and paper! Or so I thought, until the keyboard began to take its toll.

My struggles with RSI began in the waning days of BYTE magazine. I’d been working obsessively for months to complete a subscriber version of byte.com. Just as I was ready to launch it, CMP bought BYTE from McGraw-Hill only to shut us down immediately. I went home, began writing Practical Internet Groupware, and soon realized I’d done real damage to my hands and wrists. So for the rest of that summer I wrote longhand on a series of yellow legal pads.

This was ironic because my beloved Captain Kirk keyboard, later memorialized in the New York Times, was the most ergonomic typing setup there’s ever been before or since.

But even those keystrokes got to be too much. When I advocated for blogging as a mode of communication that optimizes for the amount of awareness and influence that each keystroke can possibly yield, the subtext was relief for my aching hands.

Voice input was always the dream. Periodically I would try the latest version of Dragon Naturally Speaking but it never worked fluently for prose and was hopeless for code. Until fairly recently, RSI-challenged programmers went to extraordinary lengths to code by voice. In this 2013 video Tavis Rudd demoed a method that required a huge specialized vocabulary to express commands, functions, variables, punctuation, and cursor movement. Where there’s a will there’s a way, but I knew that wasn’t for me.

A few months ago, as I began developing Bram, I realized it was time to give voice recognition another try. If you’ve used any form of it recently you’ve noticed the improvement as the rising tide of AI lifts all boats. It’s gotten way easier to dictate prose reliably. And now, suddenly, that’s also a way to produce code. You don’t have to express commands, functions, variables, and punctuation, you describe outcomes and monitor agents that do most of the writing and editing. I connected Bram to a Whisper server and the results have been dramatic. Here’s what went into last night’s v0.2.22.

– Self-heal stuck “delete pending” session rows
– Add a New session button to the Sessions page
– Render Supabase execute_sql as pretty SQL in, table out
– Add a Skills launcher to the agent pane (#221)
– Default continueLast to on so restarts resume the session
– Instrument session rotation so it names its own cause
– Force a full transcript fetch on window-miss to survive session rotation
– Observe-only: flag user-interrupt-after-permission turn ends
– Fix send-ledger false-strand across a session switch
– Render Codex exec-wrapped apply_patch as a diff
– Observe-only: flag send-ledger false-strands across a session rollover
– Use a browser-safe HTTP URL in the Target app info dialog
– Toast when a Push auto-closes issues
– Auto-close issues on push; remove agent close route (security H5, #118)
– Revert “Host-authorize issue close side effects (security H5, #118)”
– Host-authorize issue close side effects (security H5, #118)
– Widen the H4 authorization TTL to fit implementation time
– Parse Codex unified-exec (custom_tool_call name=exec) tool cards

All this required almost no typing, I just used my voice to direct Claude Code and Codex to do the research, coding, and testing. Take that, Mrs. Cloud! No more clumsy scribbling, and no more typing either. Finally I can build software by just talking to the computer. It really is a dream come true.

Small models can solve big problems

12 Jul 202612 Jul 2026 ~ Jon Udell ~ Leave a comment

In this snapshot of the Bloomington calendar you can see that events are neatly categorized.

This was an intractable problem a dozen years ago. Should the county fair land in community / social or family/kids? It’s not a critical choice, and as a user of the calendar you’d accept either. For the calendar’s curator, though, hundreds or thousands of such choices add up to an unsustainable cognitive burden.

In the Before Time you could imagine a function that takes in event titles and descriptions and uses regexes and word lists to map an event to a category. But that was unsustainable too. What we always needed, and now can have, is a function that requires no procedural code to effect that mapping. My LLM-assisted community calendar reboot calls Anthropic’s Haiku to categorize events.

It costs less than a penny a day to relieve the curator of this cognitive burden. With an agent in the loop, of course, curators must have final say. So I built an override mechanism that enables a switch from, say, family/kids to community/social. It also records those overrides and feeds them into future classifications. That seemed important but to my knowledge it has rarely if ever been used, Haiku’s mappings do the job well enough.

Tagging individual events is a poor use of a curator’s time and effort. You’d rather just encourage people and organizations to write good titles and descriptions for their events. Procedural code can’t enable that but a low-powered LLM can.

Don’t infer behavior from code, observe it in logs

8 Jul 20268 Jul 2026 ~ Jon Udell ~ Leave a comment

Agents are hardwired to be prolific writers and readers of code. As my work on Bram progressed I found that their code-first instinct wasn’t serving me well. So I began pushing them to be, also, prolific writers and readers of logs.

Bram is a Tauri app, so it’s written in Rust. But it’s also a JavaScript app that hosts a terminal where Claude Code and Codex run, and it’s an XMLUI app that reimagines how to display and interact with those terminal-based agents, and it’s a workflow governed by a set of Markdown files and Python hooks. The app’s behavior arises from the dynamic interplay of these layers, languages, and components.

Was the right message sent to the agent at the right time? Did the rule-defined workflow transition occur? Did the agent’s response render correctly? These are observations about runtime behavior. When something goes wrong, the drill is now:

– Do we have the instrumentation to know what happened?

– If no, add it.

– If yes, use it.

This applies as much to developing new features as it does to debugging existing ones. For example, Bram tracks the TUI (text user interface) menus that Claude Code and Codex present, and renders them as GUI menus. It was arguably foolish to even try this kind of screenscraping. Web pages (when not delivered as minified JavaScript) have structure that, while prone to change, is easy to target. Tap into a TUI and you’re looking at a stream of content bytes intermixed with control characters. It’s the source of truth, but a hard one to reason about. So we began gathering evidence.

The ladder of evidence

JSONL session files are the final record. But it can take a few seconds for activity to show up there, and they mainly preserve conversation not interaction. So Bram recruits three other layers: PTY, grid, and hook.

PTY input

These are bytes read from the terminal process, i.e. what the TUI sent.

[2026-07-08T13:46:16.407Z] [pty-in] gap_ms=0 bytes=202 preview=”\x1b[?2026h\x1b[18;2H…”

Fields:

– gap_ms: milliseconds since the previous PTY input chunk.
– bytes: raw byte count for this chunk.
– runs: optional, count of repeated/compactable control runs.
– preview: escaped prefix of raw bytes. ANSI/control characters are preserved as escapes like \x1b, \r, \x07.

The xterm.js grid

The PTY stream isn’t just text, it’s an instruction set for painting a terminal: move the cursor, clear regions, set colors, write characters, update the title, enter or leave bracketed paste mode. Bram uses xterm.js to render those bytes to a terminal grid, then reads the resulting screen state.

– [grid-menu] op=report provider=claude count=3 parsed_offset=446235 [1.Yes | 2.Yes, and don’t ask again for: awk -F’]’ ‘$1 >= “[2026-07-07…”‘ | 3.No]

– [grid-menu] op=build-claude-nosig tool=Bash grid_count=3 cmd=”grep -E \”hook-menu|retire-suppressor\” bram-trace.log | tail…” grid=[1.Yes | 2.Yes, and don’t ask again for: … | 3.No]

The grid layer answers questions that raw PTY bytes cannot answer directly:

– What rows are visible right now?
– Which text is inside the permission box?
– Which option labels are present?

This is the layer where TUI screenscraping becomes tractable. It’s not regexes, it’s programmatic inspection of a reconstructed terminal screen.

PTY Output

These are bytes Bram writes into the terminal.

[2026-07-08T13:45:24.145Z] [pty-out] bytes=18 preview=”claude –continue\r” is_structured=false caller_hint=agent-autostart

Fields:

– bytes: number of bytes sent.
– preview: escaped text sent to the PTY.
– is_structured: whether it came from a structured Bram intent path (propose → apply → commit).
– caller_hint: why/where the write originated.

Hooks

Claude Code and Codex both fire lifecycle hooks when using menus to ask permission. Bram’s hook scripts relay those as structured JSON, timestamped into the same trace:

– [hook-menu] op=permission provider=claude tool=Edit options=3
– [hook-menu] op=payload tool=Edit body=”{\”tool_input\”:{\”file_path\”:\”src-tauri/src/lib.rs\”,\”old_string\”:…,\”new_string\”:…},\”permission_suggestions\”:[…]}”
– [worklist-guard] tool=Edit target=docs/esc-resend-redesign.md decision=deny reason=no-coverage-no-opt-out

The hook-menu trace reports a tool name, its full input, and the permission options the TUI is about to draw.

All the layers

PTY logs preserve messy reality: control bytes, cursor movement, bracketed paste markers, title updates, spinner frames. The grid layer turns that byte stream into visible terminal state. Hooks bypass reconstruction entirely, but only for some cases. The JSONL file describes final truth, but again only for some cases. Altogether the traces combine raw, reconstructed, and declared evidence. Interpretation taps into one or several of the layers as it needs to.

From evidence to construction

I can now mostly run Bram in GUI mode without looking at the terminal. Occasionally something gets stuck, so I’m toying with the notion of auto-opening the terminal when it needs attention. Is that reliably knowable? That wasn’t a question the logs could answer so I’ve added new instrumentation. After a day of normal use I’ll know whether the feature is even feasible, and if so, how an agent should build it.

Deciphering the traces

The schemas for these log entries have evolved organically. In the Before Time I’d have worried about that. Would the logs be amenable to structured query? If not, I’d need to write a one-off script to answer each question and that was unsustainable.

But for agents, writing one-off scripts is like breathing and Bram’s rendering makes that respiration more visible.

In “What is the terminal?” I showed how agents wield the repertoire of command-line tools to make your wishes come true. I see that happening constantly as they reach for awk, grep, sed, and perl to interpret Bram’s logs. Why awk or grep in one case, sed or perl in another? LLMs are nondeterministic but if there is logic that governs these choices I’d love to know what it is.

Baked-in log-first development

As this method evolved, Claude Code and Codex absorbed it into their stored memories. That was convenient, I could just ask “Do we have the instrumentation to support that?” and they’d do the right thing. But those memories aren’t shared between agents, never mind across the various repository-backed projects where Bram can run.

As I was writing this post I found that the log-first imperative was, in fact, only recorded in private agent memories. Now it’s baked into any project where Bram runs your agents.

“What is the terminal?”

1 Jul 20261 Jul 2026 ~ Jon Udell ~ 1 Comment

In his keynote talk at the first Perl conference, Larry Wall couldn’t get the Windows computer on the podium to behave. So he SSH’d into his own machine and said, with relief and joy: “Home sweet home”.

Three decades on, software developers still live in the terminal, now more than ever as coding agents dethrone the integrated environments that held sway for so long. IDEs recede as we do less writing and editing, more reading and reviewing. If you watch developers at work today, you are likely to see them in the terminal at a command prompt.

It’s not your grandfather’s command prompt, though, it’s a terminal-based agent like Claude Code or Codex. These agents are maestros of the underlying command shell; they wield its powers far more effectively than most of us can. If you care to, this is a great way to learn by doing. Don’t take a course or watch a video to learn about git, just watch how agents use it in all its glorious complexity.

But what if you don’t care about those commands? What if you’ve never opened a terminal? The genesis of Bram was my experience helping non-coders use Claude Code. I sat them in front of my computer with two windows side-by-side: the agent in a terminal on the left, the app it was building in a browser on the right. These folks were delighted to be able to ask the agent for features and see those features appear after a browser refresh. But they did not enjoy reading the terminal to try making sense of what Claude Code was doing and saying.   

Bram started as a way to manage the side-by-side windows in a single self-contained app. As workflow emerged, the terminal remained the primary way to view and interact with the agent. What would it take to augment the terminal with a more readable display? That idea moved forward in fits and starts as I learned more about the layers involved: the session file, the pseudo-terminal (PTY), xterm.js, and agent hooks. It was hooks that finally unlocked instant and reliable recognition of the permission menus shown in the Claude Code and Codex TUIs (text user interfaces). But all the layers participate in making it possible, now, to operate Bram in full GUI mode with the terminal closed.

If you are a terminal jockey you may enjoy the more legible display of: agent messages, your messages, pasted screenshots, diffs, tool calls and results. But when I introduce non-coders to agent-assisted coding the first question is usually: “What is the terminal?”  My answer: “It’s where the agent runs the commands needed to do what you want it to do.” For me, over the past few days, the list includes:

awk, bash, bc, cargo, cat, cd, chmod, claude, codex, cp, curl, cut, date, diff, echo, exit, find, gh, git, grep, head, jq, ls, nl, node, paste, perl, pgrep, php, printf, ps, pwd, python3, rg, rm, ruby, rustfmt, sed, seq, set, sh, shasum, sleep, sort, source, sqlite3, stat, sw_vers, sysctl, tail, test, touch, tr, true, uniq, uptime, wc, whoami, zsh

These humble commands — I love that perl makes the list! — always were the foundation of computing. That hasn’t changed. What has is that newcomers are running them, indirectly, as they talk with agents to summon software into existence. For many, the terminal is a foreign and hostile environment. Now it’s optional. If you know and love the terminal it’s there in the left pane. If you’d rather not look at it, Bram offers a friendlier way to work with Claude Code and Codex in a git/GitHub repository.

“Doctor, it hurts when agents create unreviewable PRs.” “Don’t do that.”

28 Jun 202628 Jun 2026 ~ Jon Udell ~ 2 Comments

I recently attended a talk, by an engineer at a large software company, on the topic of unreviewable PRs. The problem? When agents raise PRs with thousands of lines of LLM-written adds/deletes/edits, people can’t make sense of them. The solution? Throw more agents at the problem: reviewer agents that scan what coding agents have produced, identify problems, and triage them.

I don’t make software at industrial scale, so I can’t evaluate the claim that throughput gain justifies the absence of end-to-end human engagement. What I can say is that as I use Bram to bootstrap itself, I am fully engaged thanks to the workflow embodied in the tool.

Here’s the breakdown of languages in Bram.

Language	Lines of code
Rust	24,630
JavaScript	7,542
XMLUI	4,149
Python	3,152
Markdown	1,419
XS (XMLUI)	742
Total	42,805

Bram is a Tauri desktop app, Tauri’s native language is Rust, so Rust — a language I never touched before this project — dominates. I have yet to write a single line of Rust! But I read the Rust code that Claude Code and Codex write for me, as they write it. I understand the nature and purpose of that code, and I push back when things don’t smell right.

Bram’s workflow helps do that by breaking problems into small testable chunks and processing them in an orderly way. That’s hardly a novel idea. In the LLM era we are finding new reasons to honor old best practices. We’ve always said that documentation is an essential part of the product, for example, but we haven’t always made it so. Now that readers include both people and machines we invest more effort in the docs. Why not also invite LLMs to join us in conventional agile practices?

Enriched local context

When we invite these new partners onboard, how do we orient them? Chat sessions build context that’s private to LLMs, not shared with a team of people and agents. Bram lifts that context into two kinds of shared spaces: the local worklist and the GitHub repository. On the local worklist you define a task or feature, iterate on its spec, do the task or build the feature, and iterate on outcomes. The worklist item lives in the local repo and, whether tracked or not, provides context shared between you and Claude Code, and maybe with Codex too. As shown here, it’s a one-click operation to switch between agents so one can weigh in on a plan or implementation written by the other. Here I’m about to bring in Claude as a relief pitcher.

One of the delightful emergent properties of this system has been the evocative names that agents create for worklist items. Naming is famously hard. I could conjure a name like startup-freeze-tail-fanout-diagnostics on my own but these names aren’t public-facing, they are perfectly serviceable, there is no reason for me to bear the cognitive load of creating them.

Bram records a searchable history of worklist items so my agents and I can refer to them.

Our human context windows can handle about five to seven things at a time, so I prune the worklist accordingly. If other things come up that bump the priority of startup-freeze-tail-fanout-diagnostics I can use the Drop button to clear it from the worklist. Then I can refind it on the History page, perhaps by searching for fanout, and ask the active agent to resurrect it as a new worklist item.

Human Agent in the loop

I dislike the phrase “human in the loop” because it cedes authority to the machines. Let’s flip the narrative. It’s our loop, we work the same way we always have, now we recruit agents to join the team. An agent-assisted process need not be a black box that takes in prompts and emits features.

I’m reminded of a beautiful idea of Brian Marick’s that Ward Cunningham once implemented and demoed to me. Brian called it visible workings. Ward’s implementation made an Eclipse Foundation workflow visible. When the UI presented a form, it added an Explore button that you could use to inspect the business rule that motivated the form.

Let’s do agentic software development like that. Not as a loop we’ve been excluded from, instead as one we invite agents into.

Vibe coding as a team sport

17 Jun 202617 Jun 2026 ~ Jon Udell ~ 3 Comments

In Working With Intelligent Machines, written at the beginning of my AI-assisted coding journey, I quoted from Garry Kasparov’s The Chess Master and the Computer.

The winner was revealed to be not a grandmaster with a state-of-the-art PC but a pair of amateur American chess players using three computers at the same time. Their skill at manipulating and “coaching” their computers to look very deeply into positions effectively counteracted the superior chess understanding of their grandmaster opponents and the greater computational power of other participants. Weak human + machine + better process was superior to a strong computer alone and, more remarkably, superior to a strong human + machine + inferior process.

Bram, the tool I’m building to support that kind of teamwork, puts a UI next to Claude Code and Codex and guides them through a workflow that’s anchored to git for version control and GitHub for collaboration.

A UI companion for the terminal

Here’s a picture of of me using Bram to build a standalone voice transcription app. Bram itself is a desktop app that wires together a terminal where you run Claude Code or Codex (on the left), a companion UI for them (bottom right), and the app you are developing (top right).

At the moment this screenshot was captured I was testing the first iteration of my transcription app, and discussing with Claude Code how the app will manage its own the Whisper server.

Bram’s UI puts a microphone next to several input boxes so you can capture voice, and it uses Whisper to transcribe what you say. For me this is transformative. I’ve long struggled with repetitive stress and reducing my keystroke load really helps. Now, as I use Bram to develop Bram — as well as the apps I build with it — I rarely have to type.

The UI echoes agent responses more readably than they appear in the terminal. It reports recent tool uses as a compact list of links that you can open to see tool calls and results. And when you paste a screenshot, it displays the image so you can see what you and the agent are talking about. (When you paste an image into the terminal, it just appears as [Image #1].)

Guardrails for vibe coders

The workflow brings a few layers of structure to the conversation that you’re having with agents. The guardrails are optional, but by default Bram wants you to put items on a worklist. In the software world these are often called stories on the backlog, but I’m not assuming that someone who’s using Bram will be familiar with that tradition. LLMs are bringing a lot of people to coding who have never coded before, and have never touched a terminal or git or GitHub. For them, Bram aims to be an on-ramp to these disciplines.

You ask Bram to file a new worklist item by giving it a brief description of what you want to do. Or you choose an open issue from the GitHub repository that Bram runs in, and it builds a worklist item based on that issue. The item shown in the screenshot is whisper-server-lifecycle.

However you ask Bram to create it, the new item appears with a before and after section. The before section describes the current state of play. The after section says how things will be when the plan becomes code. It lists options considered, justifies the chosen one, cites prior art in the code or in related GitHub issues, and outlines ways to verify that the changes yield the desired result.

Now the item waits at the To-Apply gate, one of two approval gates in the workflow. For either, your choices are Approve, Iterate, or Drop. If you Approve at the To-Apply gate, Bram implements the plan and advances to the next approval gate, To-Commit. But you might want to click the Iterate button and refine the plan document. You can do this as much as you want.

Flexible workflow

Often, as you iterate, you and/or your agent will realize that the current item touches other parts of the system, or suggests new ideas, or raises concerns that you haven’t considered. You can ask Bram to capture these tangents as new worklist items or GitHub issues.

Sometimes after you iterate for a while you realize that the item just doesn’t make sense — maybe not now, maybe never. Use the Drop button to remove the item from the worklist. The history is retained; you and agents can review and search that history.

Another way to preserve an item you’re not ready to take forward: ask Bram to promote it to a GitHub issue that carries the plan of record. You can bring it back later as a new issue-derived item.

During each iterate cycle you’re typing (or in my case speaking) into an input box where you can attach one or more screenshots — incredibly helpful if you’re building UI.

When ready to advance an item, click Approve. Up to this point Bram has made no changes to tracked files in the repository. Now it begins to do so, constrained by a self-created list of the files it expects to touch.

At this point you’re in the familiar loop where Claude Code is shenaniganing and wibbling or Codex is doing its equivalent. Perhaps, depending on your permission settings, they are prompting to make tool calls. Bram’s UI helps here in a couple of ways. When a tool asks permission to use an awk command with a long gnarly string of arguments, the command is easier to read than in the terminal. (Caveat: accurate parsing of the menus presented by the Claude Code and Codex TUIs — text user interfaces — is a work in progress!) And when the agent proposes a change, the diff is easier to read than in the terminal. But the terminal is right there and I tend to keep eye on it, the companion UI just gives you more to see and do. While an agent is thinking, and you are waiting, you can open and review the item’s plan. You can switch over to the Issues tab and review what’s going on there. You can review tool calls to see more clearly what’s happening under the hood. You can create and iterate new worklist items.

The team dimension

Implementing the review and approval lifecycle in a way that’s reliable, and works identically for Claude Code and Codex, has proven to be an interesting challenge. For me it’s not an either/or thing. I’ve always found it valuable to consult multiple LLMs and play one off against the other. Inside Bram, quite often, I switch from Claude Code to Codex, or vice versa, and ask one to weigh in on a worklist item, commit, or issue that was touched by the other.

That’s one aspect of the kind of teamwork that I’ve often talked about in my series of posts on working with LLMs. I regard them as a team of assistants and, until recently, I would often copy a transcript from one and paste it into the other. Now, with Bram, agents can see more than the code and documentation in the repository. They can see and react to plans on the worklist and discussion in related GitHub issues. This is useful even if you’re operating as a solo developer, because information that would otherwise be squirreled away in hidden files seen only by one agent or another are now visible to, and searchable by, all of them. It leads to amusing interactions:

“Hey Claude, grab that evidence from the log and post it along with a comment on issue 185 so Codex can weigh in.”

“Hey Codex, look at what Claude said, what’s your take?”

Bram enables this by guiding agents to the gh commands that can not only create and edit issues but also post and edit comments on issues. When you work this way your team now includes not only you and your agents but also your human team members and their agents. Bram is a solo project right now, but when I am working on XMLUI (which powers the Bram UI) this communication is directed to the whole team. To clarify who’s talking, Bram encourages agents to introduce themselves: “This is Jon’s Codex speaking, Jon asked me to weigh in”.

As the person directing the agents, you are now in a position to curate outboard context that’s available to the whole team. If you’ve worked with agents, you know that they can often be quite verbose. You get to decide how much to include. Usually I tell agents to begin with an executive summary for the benefit of people, but include full details for the benefit of other agents who will happily read and absorb this additional context.

Just enough ceremony

In How to make best use of git and GitHub for AI-assisted software development I showed how agents can wield command-line tools like git and gh on our behalf. If you use these tools regularly you may not appreciate how much tacit knowledge you’ve acquired. Without LLM help no newbie would stand a chance. But even veterans, if they are honest, will admit that these tools are byzantine and cumbersome, and that it’s a great relief to use them fluently without having to remember command syntax.

My collaborator on this project, Andrew Schulman, is using Bram to develop a tool for code analysis. When I showed him that first post about git and Github he said: “You’re underselling the workflow.” It’s early days, and things are evolving quickly, but we are both certain that we are far more effective with this workflow than without it. LLMs. Bram is already complex, Andrew’s code exam is even more complex. With Bram we feel we are bringing order to the chaos of vibe coding and managing complexity that we otherwise would not be able to handle.

I’ll let Andrew speak for himself but for me it’s about having just enough ceremony for the task at hand. If you’re doing a small thing, like changing one line of code or tweaking a piece of documentation, you can tell Bram to skip the worklist and roll your tweak into an open item or an unpushed commit. If it’s a bigger thing, you want — or you should want, and Bram wants you to have — more structure. The worklist enforces ceremony in a local and transient way, GitHub enforces it in a shared and permanent way, and work can flow in both directions as needed. For people and their agents, this is how vibe coding becomes a team sport.

How to make best use of git and GitHub for AI-assisted software development

2 Jun 20262 Jun 2026 ~ Jon Udell ~ Leave a comment

I’m working on a new tool whose tagline is the title of this post: Make best use of git and GitHub for AI-assisted software development. Called Bram (“Bram runs agents mindfully”), the tool runs as a Tauri desktop app with three panes: a terminal where you use Claude Code and/or Codex, an agent pane that embodies a workflow (rendered by XMLUI), and an app pane that hot-reloads the app you are developing. The workflow is pretty standard. Things you are working on show up on the Worklist and pass through three phases: proposed → applied → committed. The arrows between the phases are approval gates where you can dwell and iterate with your agents on what you are planning to build, or what you have built and are testing.

Bram expects you to be working in a git repository that’s hosted on GitHub, and it helps you manage a stream of issues and commits. This matters for at least three reasons.

1. It encourages agents to enact a git/gh-centric workflow that makes otherwise chaotic agent-assisted development feel safe, orderly, and accountable.

2. It helps you think clearly about the work you are doing, and proceed in well-defined chunks and sequences.

3. It makes context durable in GitHub, so prior work (and discussion about work) is available to people and agents as new work intersects with old. For example, agents can use comments on issues as architectural decision records.

This is possible because agents are really good at wielding git and GitHub on your behalf. Not long ago I had to stop and think about something as simple as git pull –rebase. Now I can easily perform feats that I rarely attempted before, like hunk-level staging and unstaging. That sounds abstract but here is the concrete need. When you propose a Worklist item, Bram figures out which files are likely to be involved. As you iterate on the proposal that list may grow or shrink. You can have multiple items in the proposed phase, before any code has been written. A second proposal might yield an overlapping list. In that case, Bram alerts you to a tradeoff. You may want to sequence the two items to avoid a merge conflict. In the Before Time that would always have been my choice, because merge conflicts were nightmares for me. I knew it was possible to untangle overlapping commits but I also knew the mechanics would likely defeat me or, even if I prevailed, would destroy my momentum. Now Bram warns about entanglement and gives me a choice. If I toggle between active work items I know I’ll incur merge cost, but the agents’ mastery of git mechanics makes it a reasonable trade-off.

Challenging git mechanics made easy

I asked Claude Code to review our recent sessions and highlight some of the ways that Bram has guided me to effective uses of git.

1. Hunk-level staging (`git add -p` and friends). Composing a focused commit out of a messy working tree by accepting / rejecting individual hunks. The mechanical cost is real — you sit through every hunk, type y/n/s/e, and if you split wrong you start over. Most developers default to `git add .` and live with sprawling commits. Bram does the patience work on your behalf and lands clean, atomic commits.

2. Squash-by-soft-reset (`git reset –soft HEAD~N && git commit`). Turning two consecutive WIP commits into one clean commit without touching the working tree. The flag combinations are intimidating (`–soft` vs `–mixed` vs `–hard`), and getting it wrong loses work. Most developers reach for `git rebase -i`, which requires an interactive editor and breaks in non-interactive contexts. Bram applies the soft-reset pattern as documented in the project conventions — no editor, no panic.

3. History archaeology (`git log -G ‘<regex>’`, `git show <sha>:<path>`). Finding when a string first appeared or disappeared from the codebase, or reading a deleted file at the revision before it was removed. The flags (`-G`, `-S`, `:<path>` ref-spec) are obscure enough that most developers never learn them and instead grep the working tree and miss the history. Bram uses them as the default first move when investigating a regression — “when did this break” becomes a one-liner instead of a half-hour bisect.

These uses are not gratuitous. In the month since its inception Bram has become the most complex piece of software I’ve ever produced. It would not have been possible without git fluency that I was never able to achieve but can now delegate to agents.

Challenging GitHub mechanics made easy

Bram expects that, in addition to git, you have also installed gh, the command-line interface to GitHub. Here are some of the ways Bram has guided me to effective uses of gh (again, courtesy of Claude Code’s session introspection).

1. `gh api` with `–paginate` and `–jq`. Hand-rolled REST queries against the GitHub API with pagination handled and JSON filtered down to exactly the fields you want — e.g. “all open issues across these five repos with label X, formatted as TSV.” Doing this without `gh` means `curl` + Bearer-token auth + manual `Link:` header parsing for pagination + a separate `jq` invocation, and any one of those steps deters most developers from starting. With `gh api –paginate … –jq …` it’s a single shell line; Bram composes them routinely for cross-issue analytics that would be impractical to do by hand.

2. Filtered listing and search (`gh issue list –search ‘…’`, `gh search code`). GitHub’s search syntax (`is:open label:bug -author:dependabot updated:>2026-05-01`) is powerful but finicky enough that hand-typing it is error-prone. The web UI search box is fine for one-offs but doesn’t compose into a script. Bram drops the right `–search` string in once, pipes through `–json` / `–jq`, and the result feeds the next decision — the kind of “show me everything that matches X, then triage” loop that’s tedious to do by clicking.

3. Multi-line body composition with `–body-file`. Authoring a rich issue or PR body (tables, fenced code blocks, embedded diffs) in markdown, then posting it without losing structure to shell-escape hell. The alternative is the web UI’s textarea, which means leaving your terminal, switching to a browser, retyping context, and losing the ability to compose the body programmatically. Bram writes the body to `/tmp/foo.md`, then `gh issue create –body-file /tmp/foo.md` — bodies stay byte-perfect, and the same pattern composes with templates and generated content.

Fluent use of GitHub issues opens up a rich vein to be mined, and Bram’s guidance to agents encourages them to dig into it. You can see a couple of valuable nuggets in issue 170. In that thread I invited Claude Code and Codex to review one anothers’ work, narrate testing with log evidence, cite related work, record architectural pivots, summarize closure, and point to next steps.

When you externalize parts of session logs to a shared space where people and their agents can collaborate, multiple benefits accrue. For people it provides transparency and accountability. Decisions and tactics aren’t squirreled away in dot file on a per-machine-per-user basis. They are accessible to the whole team both interactively and by means of gh APIs that were formerly daunting but now easily wielded by agents on our behalf.

For agents, GitHub is a place to record context, drawn from current work, that powerfully informs future work — again by way of gh APIs that agents easily wield. The release notes that Claude Code has been writing for Bram are a beautiful example of what is now possible. I always aspired to that kind of discipline but stumbled over mechanics. And that was in the Before Time when release cycles like these might be bi-monthly versus daily occurrences.

Here’s a more complete list of git and gh patterns mined from my session logs.

GitHub for the rest of us

A decade ago, in GitHub for the rest of us, I wrote:

The tools that enable software developers to work and the cultures that surround the use of those tools tend to find their way into the mainstream. It seems obvious, in retrospect, that email and instant messaging — both used by developers before anybody else — would have reached the masses. Those modes of communication were relevant to everyone.

It’s less obvious that Git, the tool invented to coordinate the development of the Linux kernel, and GitHub, the tool-based culture that surrounds it, will be as widely relevant. Most people don’t sling code for a living. But as the work products and processes of every profession are increasingly digitized, many of us will gravitate to tools designed to coordinate our work on shared digital artifacts. That’s why Git and GitHub are finding their way into workflows that produce artifacts other than, or in addition to, code.

I hope Bram will help fulfill that promise, and I think it could. Meanwhile it aims to help make otherwise chaotic agent-assisted coding orderly and accountable for non-coders newly empowered by agents, as well as for coders who want to wield git and GitHub more fluently.

Should you try Bram? Honestly I’m not sure. It’s only a month old, and there are only a handful of testers hammering on it, primarily me (using Bram to bootstrap itself) and Andrew Schulman who is using it to develop a tool for LLM-assisted code analysis. We are only an n of 2, but are both finding that Bram’s git/gh workflow is a powerful way to organize and advance our work. You might want to wait a week or two while we iron out some kinks. But if you do tirekick, please let us know how it goes!

Beyond The Dip

17 Mar 202618 Mar 2026 ~ Jon Udell ~ 1 Comment

I had an idea about 15 years ago that I wound up pursuing a lot longer than I should have. Near the end of that era I read an essay by Seth Godin called The Dip, about that low point when an idea you are convinced is worthy just isn’t taking hold. How do you know when to push on in order to break through, and when to fold because it’s a dead end?

In my case I wound up not having a choice. It was a weird project to be doing as a Microsoft evangelist with a vaguely-defined portfolio, things weren’t working out for anyone. I moved on and didn’t think much about it for a decade. Then someone asked if it might still be viable. I realized it had become possible to reboot the project and overcome one of the former obstacles: the need for a lot of boring, uncomplicated, but custom software.

The new version sat as a proof of concept for another year or so, then started to attract a few demand signals. Now it’s the Claude Code era and everything has come together in a hurry, meeting and even surpassing former goals.

So here I am on the other side of The Dip, facing the same question: will the idea take hold? The problem it aims to help people solve is still universally acknowledged to be unsolved, and the solution looks more plausible than ever. Of course I am not the only person spending an unhealthy amount of time directing genies to summon useful software into existence. Some are programmers who savor newfound empowerment. Others are not programmers and they savor it even more. They are systems thinkers. They know what they need and roughly how it should work, and can direct the genies to make it so. If good ideas are a dime a dozen, so now also are good executions of ideas. So I reckon it’s a level playing field where, as always, value plus luck may succeed.

If I do find myself back in The Dip again, I won’t try to push the idea farther than it wants to go. If the world needs it, and can now embrace it, I am there for that. If not, I have other irons in the fire.

Those who know me know the backstory, for those who don’t the details don’t matter. If you have been on the other side of the Dip, I’m curious: what happened?

How LLMs make Git and GitHub easier to use and learn

12 Mar 202612 Mar 2026 ~ Jon Udell ~ Leave a comment

I once wrote an article with the optimistic title GitHub for the rest of us. The idea was that everyone who works with others on collections of shared documents needs a powerful and easy way to see and manage change. The foundational tool for software version control — git — is incredibly powerful, and the GitHub interface layered on top of it does make git a lot easier to use. But my optimism was premature. There are some non-programmers who make effective use of GitHub but it’s still mostly for programmers working on code and documentation.

If you read that article you might have assumed that git and GitHub are easy for coders to use. No! These are wildly complex tools. I’ve worked with them every day for years, have barely mastered the basics, struggle mightily with advanced features, and dread making mistakes that will affect team members. Even if you can become adept at using them it will have been an arduous climb up the learning curve, and daily use imposes cognitive burden. You’d rather focus on whatever you’re actually trying to do, not how to resolve a merge conflict.

Now, suddenly, that cognitive burden has lifted. LLMs don’t just write code, they also wield tools like git and GitHub while doing so. You don’t hear about this in mainstream stories about the end of computer programming. Coders assign tasks to teams of agents, we’re told, then wait for results to appear. What then? Those results need to be integrated safely into complex systems. That’s a job for git and GitHub, and the LLM boost occurs there too.

Yesterday I was working on a project whose ingredients are spread across three different GitHub repositories. I need to keep track of multiple branches in each of my local copies, and be aware of the state of those branches in the remote copies. I know there are plenty of coders who can navigate this complexity, though they pay a cognitive task to do so. For me, until recently, the daunting logistics brought progress to a crawl.

I asked Claude to report how it has been helping me do these things, here’s part of the answer.

Branch archaeology

Tracing where work lives across branches: You had wrapping work spread across judell/wrap-component, wrap-component-2, -3, and -4. I ran git branch –contains <sha> and git log –all –oneline — <file> to determine that TextBox wrapping was on the original wrap-component branch (not -2 as you initially thought), FileInput wrapping was also on wrap-component, and Slider was on -2.

Diffing across branches: git diff judell/wrap-component-2…judell/wrap-component-4 –name-only to see 117 component files changed between branches, understanding what work had accumulated where.

Finding orphaned commits: git log –all –oneline — xmlui/src/components/TextBox/TextBox.tsx revealed the full history across all branches, showing which commits were on which branches.

Assessing divergence before porting work

Determining cherry-pick feasibility: Diffed the wrap-component branch’s TextBox against wrap-4’s version and discovered significant divergence (responsive layout changes, CSS regression fixes had landed in between), ruling out cherry-picking and leading to the decision to re-apply the wrapping pattern manually.

Checking if core infrastructure changed: Compared wrapComponent.tsx history on both branches (4 commits on wrap-4 vs 9 on the original) to understand whether the API had shifted.

One of the tools in git’s kit is called bisect. Bugs aren’t always immediately evident. What if there have been 30 new versions since the one you are certain was working properly? The bisect command enables a divide-and-conquer strategy: split at the midpoint, check if the bug still exists, then keep splitting until you find the last version that worked. The next version is the culprit.

I’ve always known about bisect and always struggled to use it well. As with all git commands the syntax is arcane and use is tricky. When I mentioned to a friend that Claude had empowered me to be a better user of git bisect he objected. “I might be old-school,” he said, “but I feel like I need to know how these things work.” I agreed! What I brought to the table was the knowledge that git bisect was the right tool for the job. Claude Code brought the ability to wield the tool effectively. And as it did so, I watched and learned. This aspect of LLM use is not a black box. When agents run commands on your behalf you can see and approve them.

“I should probably take an online course,” my friend said, “or watch some videos.” You can, I said, but there’s no better learning experience than to be guided through the use of a tool in a situation where you need it to solve a problem in the work you’re actually doing.

One my first posts at the dawn of the LLM era was entitled Radical just-in-time learning. In Using AI Effectively As A Student, Carson Gross (yes, that’s the HTMX guy) implores his students to use LLMs properly. I’ll paraphrase:

You are playing with fire, you can use these things in a ways that help or harm your intellectual development, I can’t choose for you, be aware.

It won’t be an easy choice, and concerns about de-skilling are real and valid. (From today’s NYT story: “If you don’t use it, you lose it.”) But nothing requires us to cede autonomy to our freakishly talented LLM assistants. We direct their efforts, and they learn from us. As we do the work they wield tools on our behalf. We can, if we choose, learn from them how best to use those tools, even as we often delegate the use to them.

AI-assisted code refactoring

12 Jan 2026 ~ Jon Udell ~ Leave a comment

Tools built to generate vast amounts of code can, paradoxically, help us write less of it: How To Use LLMs for Continuous, Creative Code Refactoring

LLM series at The New Stack

The LLM flywheel effect

10 Nov 2025 ~ Jon Udell ~ Leave a comment

How to manage a team of AI assistants in a virtuous cycle of improvement.

The LLM flywheel effect

LLM series at The New Stack

Release the Kraken!

2 Nov 20252 Nov 2025 ~ Jon Udell ~ 1 Comment

Tuscon’s Museum of Miniatures features hundreds of exhibits like this one.

“Artist Madelyn Cook spent over 3 years planning and constructing Lagniappe, which includes two separate wings and 25 individual rooms.”

People have been making these for hundreds of years, but in recent decades practioners have become more precise about measurement and scale. Many of the exhibits use a 1:12 (inch:foot) ratio.

“Cook chose to portray the estate of a fictional merchant sea captain and his family living during the American colonial period.”

The fine detail is mind boggling. See that page on the desk above? You can actually read it.

There are rooms full of these installations, many of which date from the 1980s and 1990s when an American community of practice coalesced around the style.

“4 room Rococo château, with furnishings inspired by European palaces such as those of Seville and Versailles. Designed and created by Schoenbach, of Atlanta, Georgia, over a 30-year period.”

I would guess that the whole collection representions millions of hours of effort. It’s almost overwhelming to contemplate.

This guy, Salavat Fidai, sculpts pencil tips. His medium is not quite as insane as that of Willard Wigan, whose work I saw at The Museum of Jurassic Technology. But it pushes the envelope.

As amazing as these miniatures are, I might not have made the visit just to see them. The tractor beam that pulled me in was the special exhibit of Ray Harryhausen’s orginal animatronic models and drawings. Here’s the Kraken from Clash of the Titans.

According to the Harryhausen Foundation’s podcast, he took creative liberties when bringing the legends to life. For example, this scene is a mashup of Jason and the Argonauts and the Labors of Hercules. It was actually Hercules who fought the Hydra. This bothered some classicists but Harryhausen was a pragmatist: “We have to manipulate certain aspects in order to make a movie that will flow.”

Who doesn’t love Bubo the mechanical owl?

American censors, however, did not love bare-breasted Medusa, though they were perfectly fine with her violent and bloody decapitation. Europeans, unsurprisingly, had the inverse reaction.

The skeletons from the iconic swordfighting scene were smaller than I imagined.

This model is from a film I never heard of.

The sign says:

The Story of the Tortoise and the Hare

Ray Harryhausen

c. 1952

This is the original model, rediscovered in 2008. An identical replica was made in 2002 to complete this unfinished film, 50 years later.

In 2002, Seamus Walsh and Mark Caballero of Screen Novelties, the award-winning American stop-motion animation studio, worked with Ray Harryhausen to complete his final fairy tale film, The Story of the Tortoise and the Hare, which Ray began in 1953 and never finished. Ray was delighted and grateful for their assistance and greatly admired how Mark and Seamus were able to seamlessly blend the new and original footage.

You can see the remarkable collection of miniatures anytime. But the Harryhausen exhibit, which arrived in Tuscon in September and leaves next May, is a rare U.S. appearance of artifacts that normally reside in Scotland. (Why? Ray’s wife, Diana, had very strong links to Scotland, being the great-great granddaughter of explorer David Livingstone.) So visit soon if you can!

A day in Sequoia National Park

30 Oct 202530 Oct 2025 ~ Jon Udell ~ Leave a comment

Exactly one hundred and fifty years ago John Muir walked around in the same grove of giant sequoia trees that I walked around in today, and stood next to the same two thousand ton behemoth that had been growing for two and a half millenia.

It has only been known as the General Sherman tree for a tiny fraction of its immense lifespan. I imagine it standing there blissfully unaware of its association with a cruel and destructive human being, indeed unaware of any human activity at all.

But we are making our presence known.

“Death of large sequoias (over 4 ft in diameter) in wildfires prior to 2015 was very rare”

This was my first trip to Sequoia National Park. I explored the tiny section shown on this 1927 USGS topological map.

(Wikipedia)

It’s worth clicking through to the high-res version, zooming in, and imagining what it was like to reach that place in 1875 before there were roads and cars never mind GPS-connected handheld computers.

On the Congress trail in this densest of Sequoiadendron giganteum groves, other magnificent specimens suffer comparison to notable Americans, most painfully this cluster called The House. (There’s a Senate too.)

I live among coast redwoods and was delighted to finally meet their shorter and stouter cousins. If you’ve been thinking about a visit, know that the park is open but unstaffed. I only saw one ranger and he was on latrine duty, nobody is collecting the entrance fee, yet another bit of economic fallout from the shutdown.

After walking the Congress trail I headed down to the museum (which is closed), hiked over to Moro Rock, and walked up the steps to take in the view.

(Wikipedia)

Someday I hope to ascend Half Dome using the cable hand rails but this was an easy way to enjoy the view from a big granite dome. Whitney is only a dozen miles away but “the Great Western Divide rises high enough to block it”.

My day started in Three Rivers and ended in Tehachapi after a long and rewarding detour into another section of the park.

The road up to Lake Isabella winds gradually through Sierra foothills that seemed mellower and more mesmerizing than the ones I’ve seen farther north. The road down follows the Kern River as it flows over endless pillows of granite. There’s nothing like a big dose of the majesty of California, a friend likes to say. It sure was powerful medicine today.

Reimagining car culture

25 Oct 202525 Oct 2025 ~ Jon Udell ~ Leave a comment

The Volts podcast continues to be my favorite listen. Climate change will wreak ever more havoc on the world, that’s just baked in. But the transition to clean energy is also now baked in. David Roberts delivers a steady stream of hopeful news on that front: plummeting prices for solar panels and batteries, “reconductoring” to grow the capacity of the existing grid, agrivoltaics, new geothermal techniques, and much more.

Cars are a big part of the story. Switching to EVs is great but if we only do that we are still stuck with too many large heavy vehicles that clog roads when moving, waste vast amounts of space when parked, and harm people who move through the world on foot or on bicycles. We don’t just want cleaner cars, we also want far fewer of them. This episode, with the authors of Life After Cars, explores the “tyranny of the automobile”.

American car culture always seemed wrong to me, for many reasons. On this show David Roberts crystallized one of them.

When you ride a bike through Amsterdam, you are a dozen times every minute making small adjustments to other people, and you are accommodating yourself and coordinating with other people in these micro ways over and over and over again as you ride through Amsterdam.

And it just has an effect. You realize you’re living among other people and you’re involved in a common project and you live in a common place and you’re together in the place.

I have long been fascinated by a video called A trip down Market Street. Filmed in San Francisco in 1906, shortly before the great quake, it’s a long shot that moves down Market Street toward the Ferry Building. You see a free-for-all of trolleys, pedestrians, bicycles, horsedrawn carriages, and cars. Clearly the cars are going to win but in this moment they are not yet hermetically sealed shells, they have open tops so drivers see one another and make the same kinds of micro-adjustments to cyclists and pedestrians.

In a San Franciso with fewer and more autonomous cars, can we imagine a way to recapture that kind of sociality?

Maps old and new

4 Oct 20255 Oct 2025 ~ Jon Udell ~ 4 Comments

I’m visiting with American friends who are staying in a rural farmhouse in France’s Dordogne valley. The house, which might be several hundred years old, provides faster internet access than my fiberoptic setup at home. The cars we are piloting along these ancient byways have touchscreens that control Bluetooth and satellite connections. It feels like the perfect juxtaposition of the old and the new. But the illusion cracked yesterday when we headed out to visit the medieval town of Sarlat-la-Canéda. I punched “Sarlat” into the satnav and off we went, choosing the slowest but most scenic of the offered routes. As we approached the destination my friend said: “Something is wrong, Sarlat is small but it’s not this small.” You can probably guess what happened. The maps app had found a tiny hamlet 50 miles to the north instead of the populous town 30 miles to the west. Although I know better I fell for the illusion: I’m on vacation, let the machine take care of the details, we’ll just enjoy the view. Oops.

It wasn’t really a problem. We had plenty of time, we’ve been taking back roads in order to see the countryside, we just ended up seeing more and different countryside than planned. But unlike the last time I toured France, almost 25 years ago when connected phones and map apps weren’t yet a thing, I didn’t have a conventional map and neither did my friends. Had I looked at one we would never have made this error. The map on your phone isn’t really a map, it’s a tiny viewport that can see the whole planet at any resolution but never provides the context your brain needs to reason about spatial relationships. It’ll get you from point A to point B but struggles to convey where B is in relation to C.

I’m not blaming the tech, it is a miracle I will never take for granted. The fault is entirely mine for not having a real map, spreading it out on the kitchen table before we left, enjoying a beautiful and information-dense work of cartographic art, and planning the trip with the big picture in view. That would have been another nice juxtaposition of old and new. On my next GPS-guided trip to town I’ll pick up a real map: another miracle I should never take for granted.

Update: Look what we found in a drawer. Made by Institut Géographique National in 1972.

I reckon you’d need a 16K x 12K screen to view it at print resolution.

Context engineering anchors AI agents to ground truth

8 Sep 2025 ~ Jon Udell ~ Leave a comment

Although autonomous LLMs are inherently unreliable, there’s a long software tradition of building reliable layers on top of unreliable layers. That applies here too. We can’t guarantee that you’ll never be led astray when building an XMLUI app with the help of agents that use the XMLUI MCP server to extract patterns from docs, sources, how-tos, and samples. But it’s a lot more likely now that you and your AI team will stay anchored to ground truth. At this point, I would define context engineering as whatever it takes to make that happen.

Context engineering anchors AI agents to ground truth.

LLM series at The New Stack

Introducing XMLUI

18 Jul 202524 Jul 2025 ~ Jon Udell ~ 20 Comments

In the mid-1990s you could create useful software without being an ace coder. You had Visual Basic, you had a rich ecosystem of components, you could wire them together to create apps, standing on the shoulders of the coders who built those components. If you’re younger than 45 you may not know what that was like, nor realize web components have never worked the same way. The project we’re announcing today, XMLUI, brings the VB model to the modern web and its React-based component ecosystem. XMLUI wraps React and CSS and provides a suite of components that you compose with XML markup. Here’s a little app to check the status of London tube lines.

<App>
  <Select id="lines" initialValue="bakerloo">
    <Items data="https://api.tfl.gov.uk/line/mode/tube/status">
    </Items>
  </Select>
  <DataSource
    id="tubeStations"
    url="https://api.tfl.gov.uk/Line/{lines.value}/Route/Sequence/inbound"
    resultSelector="stations"/>
  <Table data="{tubeStations}" height="280px">
    <Column bindTo="name" />
    <Column bindTo="modes" />
  </Table>
</App>

A dozen lines of XML is enough to:

Define a Select and fill its Items with data from an API call.
Define a DataSource to fetch data from another API call.
Use the value of the Select to dynamically form the URL of the DataSource.
Use a resultSelector to drill into the result of the second API call.
Bind that result to a Table.
Bind fields in the result to Columns.

This is a clean, modern, component-based app that’s reactive and themed without requiring any knowledge of React or CSS. That’s powerful leverage. And it’s code you can read and maintain, no matter if it was you or an LLM assistant who wrote it. I’m consulting for the project so you should judge for yourself, but to me this feels like an alternative to the JavaScript industrial complex that ticks all the right boxes.

Components

My most-cited BYTE article was a 1994 cover story called Componentware. Many of us had assumed that the engine of widespread software reuse would be libraries of low-level objects linked into programs written by skilled coders. What actually gained traction were components built by professional developers and used by business developers.

There were Visual Basic components for charting, network communication, data access, audio/video playback, and image scanning/editing. UI controls included buttons, dialog boxes, sliders, grids for displaying and editing tabular data, text editors, tree and list and tab views. People used these controls to build point-of-sale systems, scheduling and project management tools, systems for medical and legal practice management, sales and inventory reporting, and much more.

That ecosystem of component producers and consumers didn’t carry forward to the web. I’m a fan of web components but it’s the React flavor that dominate and they are not accessible to the kind of developer who could productively use Visual Basic components back in the day. You have to be a skilled coder not only to create a React component but also to use one. XMLUI wraps React components so solution builders can use them.

User-defined components

XMLUI provides a deep catalog of components including all the interactive ones you’d expect as well as behind-the-scenes ones like DataSource, APICall, and Queue. You can easily define your own components that interop with the native set and with one another. Here’s the markup for a TubeStops component.

<Component name="TubeStops">
  <DataSource
    id="stops"
    url="https://api.tfl.gov.uk/Line/{$props.line}/StopPoints"
    transformResult="{window.transformStops}"
  />
  <Text variant="strong">{$props.line}</Text>
  <Table data="{stops}">
    <Column width="3*" bindTo="name" />
    <Column bindTo="zone" />
    <Column bindTo="wifi" >
      <Fragment when="{$item.wifi === 'yes'}">
        <Icon name="checkmark"/>
      </Fragment>
    </Column>
    <Column bindTo="toilets" >
      <Fragment when="{$item.toilets === 'yes'}">
        <Icon name="checkmark"/>
      </Fragment>
    </Column>
  </Table>
</Component>

Here’s markup that uses the component twice in a side-by-side layout.

  <HStack>
    <Stack width="50%">
      <TubeStops line="victoria" />
    </Stack>
    <Stack width="50%">
      <TubeStops line="waterloo-city" />
    </Stack>
  </HStack>

It’s easy to read and maintain short snippets of XMLUI markup. When the markup grows to a hundred lines or more, not so much. But I never need to look at that much code; when components grow too large I refactor them. In any programming environment that maneuver entails overhead: you have to create and name files, identify which things to pass as properties from one place, and unpack them in another. But the rising LLM tide lifts all boats. Because I can delegate the refactoring to my team of AI assistants I’m able to do it fluidly and continuously. LLMs don’t “know” about XMLUI out of the box but they do know about XML, and with the help of MCP (see below) they can “know” a lot about XMLUI specifically.

Reactivity

If you’ve never been a React programmer, as I have not, the biggest challenge with XMLUI-style reactivity isn’t what you need to learn but rather what you need to unlearn. Let’s take another look at the code for the app shown at the top of this post.

<App>
  <Select id="lines" initialValue="bakerloo">
    <Items data="https://api.tfl.gov.uk/line/mode/tube/status">
        <Option value="{$item.id}" label="{$item.name}" />
    </Items>
  </Select>
  <DataSource
    id="tubeStations"
    url="https://api.tfl.gov.uk/Line/{lines.value}/Route/Sequence/inbound"
    resultSelector="stations"/>
  <Table data="{tubeStations}" height="280px">
    <Column bindTo="name" />
    <Column bindTo="modes" />
  </Table>
</App>

Note how the Select declares the property id="lines". That makes lines a reactive variable.

Now look at the url property of the DataSource. It embeds a reference to lines.value. Changing the selection changes lines.value. The DataSource reacts by fetching a new batch of details. Likewise the Table‘s data property refers to tubeStations (the DataSource) so it automatically displays the new data.

There’s a name for this pattern: reactive data binding. It’s what spreadsheets do when a change in one cell propagates to others that refer to it. And it’s what React enables for web apps. React is a complex beast that only expert programmers can tame. Fortunately the expert programmers who build XMLUI have done that for you. As an XMLUI developer you may need to unlearn imperative habits in order to go with the declarative flow. It’s a different mindset but if you keep the spreadsheet analogy in mind you’ll soon get the hang of it. Along the way you’ll likely discover happy surprises. For example, here’s the search feature in our demo app, XMLUI Invoice.

Initially I wrote it in a conventional way, with a search button. Then I realized there was no need for a button. The DataSource URL that drives the query can react to keystrokes in the TextBox, and the Table can in turn react when the DataSource refreshes.

<Component name="SearchEverything">
    <VStack paddingTop="$space-4">
        <TextBox
            placeholder="Enter search term..."
            width="25rem"
            id="searchTerm"
        />
        <Card when="{searchTerm.value}">
            <DataSource
              id="search"
              url="/api/search/{searchTerm.value}"
            />
            <Text>Found {search.value ? search.value.length : 0} results for
                "{searchTerm.value}":</Text>
            <Table data="{search}">
                <Column  bindTo="table_name" header="Type" width="100px" />
                <Column  bindTo="title" header="Title" width="*" />
                <Column  bindTo="snippet" header="Match Details" width="3*" />
            </Table>
        </Card>
    </VStack>
</Component>

Themes

When the team first showed me the XMLUI theme system I wasn’t too excited. I am not a designer so I appreciate a nice default theme that doesn’t require me to make color choices I’m not qualified to make. The ability to switch themes has never felt that important to me, and I’ve never quite understood why developer are so obsessed with dark mode. I have wrestled with CSS, though, to achieve both style and layout effects, and the results have not been impressive. XMLUI aims to make everything you build look good, and behave gracefully, without requiring you to write any CSS or CSS-like style and layout directives.

You can apply inline styles but for the most part you won’t need them and shouldn’t use them. For me this was another unlearning exercise. I know enough CSS to be dangerous and in the early going I abused inline styles. That was partly my fault and partly because LLMs think inline styles are catnip and will abuse them on your behalf. If you look at the code snippets here, though, you’ll see almost no explicit style or layout directives. Each component provides an extensive sets of theme variables that influence its text color and font, background color, margins, borders, paddings, and more. They follow a naming convention that enables a setting to control appearance globally or in progressively more granular ways. For example, here are the variables that can control the border color of a solid button using the primary color when the mouse hovers over it.

color-primary
backgroundColor-Button
backgroundColor-Button-solid
backgroundColor-Button-primary
backgroundColor-Button-primary-solid
backgroundColor-Button-primary-solid--hover

When it renders a button, XMLUI works up the chain from the most specific setting to the most general. This arrangement gives designers many degrees of freedom to craft exquisitely detailed themes. But almost all the settings are optional, and those that are defined by default use logical names instead of hardcoded values. So, for example, the default setting for backgroundColor-Button-primary is $color-primary-500. That’s the midpoint in a range of colors that play a primary role in the UI. There’s a set of such semantic roles, each associated with a color palette. The key roles are:

Surface: creates neutral backgrounds and containers.

Primary: draws attention to important elements and actions.

Secondary: provides visual support without competing with primary elements.

What’s more, you can generate complete palettes from single midpoint value for each.

name: Earthtone
id: earthtone
themeVars:
  color-primary: "hsl(30, 50%, 30%)"
  color-secondary: "hsl(120, 40%, 25%)"
  color-surface: "hsl(39, 43%, 97%)"

Themes aren’t just about colors, though. XMLUI components work hard to provide default layout settings that yield good spacing, padding, and margins both within individual components and across a canvas that composes sets of them. I am, again, not a designer, so not really qualified to make a professional judgement about how it all works. But the effects I can achieve look pretty good to me.

Scripting

As a Visual Basic developer you weren’t expected to be an ace coder but were expected to be able to handle a bit of scripting. It’s the same with XMLUI. The language is JavaScript and you can go a long way with tiny snippets like this one in TubeStops.

<Fragment when="{$item.wifi === 'yes'}"></Fragment>

TubeStops does also use the transformResult property of its DataSource to invoke a more ambitious chunk of code.

function transformStops(stops) {
  return stops.map(stop => {
    // Helper to extract a value from additionalProperties by key
    const getProp = (key) => {
      const prop = stop.additionalProperties && stop.additionalProperties.find(p => p.key === key);
      return prop ? prop.value : '';
    };
    return {
      name: stop.commonName,
      zone: getProp('Zone'),
      wifi: getProp('WiFi'),
      toilets: getProp('Toilets'),
      // A comma-separated list of line names that serve this stop
      lines: stop.lines ? stop.lines.map(line => line.name).join(', ') : ''
    };
  });
}

This is not trivial, but it’s not rocket science either. And of course you don’t need to write stuff like this nowadays, you can have an LLM assistant do it for you. So we can’t claim that XMLUI is 100% declarative. But I think it’s fair to say that the imperative parts are well-scoped and accessible to a solution builder who doesn’t know, or want to know, anything about the JavaScript industrial complex.

Model Context Protocol

In the age of AI, who needs XMLUI when you can just have LLMs write React apps for you? It’s a valid question and I think I have a pretty good answer. The first version of XMLUI Invoice was a React app that Claude wrote in 30 seconds. It was shockingly complete and functional. But I wasn’t an equal partner in the process. I’m aware that React has things like useEffect and useContext but I don’t really know what they are or how to use them properly, and am not competent to review or maintain JavaScript code that uses these patterns. The same disadvantage applies to the CSS that Claude wrote. If you’re a happy vibe coder who never expects to look at or work with the code that LLMs generate, then maybe XMLUI isn’t for you.

If you need to be able review and maintain your app, though, XMLUI levels the playing field. I can read, evaluate, and competently adjust the XMLUI code that LLMs write. In a recent talk Andrej Karpathy argues that the sweet spot for LLMS is a collaborative partnership in which we can dynamically adjust how much control we give them. The “autonomy slider” he envisions requires that we and our assistants operate in the same conceptual/semantic space. That isn’t true for me, nor for the developers XMLUI aims to empower, if the space is React+CSS. It can be true if the space is XMLUI.

To enhance the collaboration we provide an MCP server that helps you direct agents’ attention as you work with them on XMLUI apps. In MCP is RSS for AI I described the kinds of questions that agents like Claude and Cursor can use xmlui-mcp to ask and answer:

Is there a component that does [X]?

What do the docs for [X] say about topic [Y]?

How does the source code implement [X]?

How is [X] is used in other apps?

You place the xmlui-mcp server alongside the xmlui repo which includes docs and source code. And the repo in which you are developing an XMLUI app. And, ideally, other repos that contain reference apps like XMLUI Invoice.

Working with LLMs

This arrangement has mostly exceeded my expectations. As I build out a suite of apps that exemplify best practices and patterns, the agentic collaboration improves. This flywheel effect is, of course, still subject to the peculiar habits of LLM assistants who constantly need to be reminded of the rules.

1 don’t write any code without my permission, always preview proposed changes, discuss, and only proceed with approval.

2 don’t add any xmlui styling, let the theme and layout engine do its job

3 proceed in small increments, write the absolute minimum amount of xmlui markup necessary and no script if possible

4 do not invent any xmlui syntax. only use constructs for which you can find examples in the docs and sample apps. cite your sources.

5 never touch the dom. we only use xmlui abstractions inside the App realm, with help from vars and functions defined on the window variable in index.html

6 keep complex functions and expressions out of xmlui, they can live in index.html or (if scoping requires) in code-behind

7 use the xmlui mcp server to list and show component docs but also search xmlui source, docs, and examples

8 always do the simplest thing possible

It’s like working with 2-year-old savants. Crazy, but it can be effective!

To increase the odds that you’ll collaborate effectively, we added a How To section to the docs site. The MCP server makes these articles visible to agents by providing tools that list and search them. This was inspired by a friend who asked: “For a Select, suppose you don’t have a static default first item but you want to fetch data and choose the first item from data as the default selected, how’d you do that in xmlui?” It took me a few minutes to put together an example. Then I realized that’s the kind of question LLMs should be able to ask and answer autonomously. When an agent uses one of these tools it is anchored to ground truth: an article found this way has a citable URL that points to a working example.

It’s way easier for me to do things with XMLUI than with React and CSS, but I’ve also climbed a learning curve and absorbed a lot of tacit knowledge. Will the LLM-friendly documentation flatten the learning curve for newcomers and their AI assistants? I’m eager to find out.

Content management

We say XMLUI is for building apps, but what are apps really? Nowadays websites are often apps too, built on frameworks like Vercel’s Next.js. I’ve used publishing systems built that way and I am not a fan. You shouldn’t need a React-savvy front-end developer to help you make routine changes to your site. And with XMLUI you don’t. Our demo site, docs site, and landing page are all XMLUI apps that are much easier for me to write and maintain than the Next.js sites I’ve worked on.

“Eating the dogfood” is an ugly name for a beautiful idea: Builders should use and depend on the things they build. We do, but there’s more to the story of XMLUI as a CMS. When you build an app with XMLUI you are going to want to document it. There’s a nice synergy available: the app and its documentation can be made of the same stuff. You can even showcase live demos of your app in your docs as we do in component documentation, tutorials, and How To articles.

I was an early proponent of screencasts for software demos, and it can certainly be better to show than tell, but it’s infuriating to search for the way to do something and find only a video. Ideally you show and tell. Documenting software with a mix of code, narrative, and live interaction brings all the modalities together.

Extensibility

Out of the box, XMLUI wraps a bunch of React components. What happens when the one you need isn’t included? This isn’t my first rodeo. In a previous effort I leaned heavily on LLMs to dig through layers of React code but was still unable to achieve the wrapping I was aiming for.

For XMLUI the component I most wanted to include was the Tiptap editor which is itself a wrapper around the foundational ProseMirror toolkit. Accomplishing that was a stretch goal that I honestly didn’t expect to achieve before release. But I was pleasantly surprised, and here is the proof.

This XMLUI TableEditor is the subject of our guide for developers who want to understand how to create an XMLUI component that wraps a React component. And isn’t just a toy example. When you use XMLUI for publishing, the foundation is Markdown which is wonderful for writing and editing headings, paragraphs, lists, and code blocks, but awful for writing and editing tables. In that situation I always resort to a visual editor to produce Markdown table syntax. Now I have that visual editor as an XMLUI component that I can embed anywhere.

The React idioms that appear in that guide were produced by LLMs, not by me, and I can’t fully explain how they work, but I am now confident it will be straightforward for React-savvy developers to extend XMLUI. What’s more, I can now see the boundary between component builders and solution builders begin to blur. I am mainly a solution builder who has always depended on component builders to accomplish anything useful at that level. The fact that I was able to accomplish this useful thing myself feels significant.

Deployment

Here’s the minimal XMLUI deployment footprint for the TableEditor.

TableEditor
├── Main.xmlui
├── index.html
└── xmlui
    └── 0.9.67.js

The index.html just sources the latest standalone build of XMLUI.

<script src="xmlui/0.9.67.js"></script>

Here’s Main.xmlui.

<App var.markdown="">
  <Card>
    <TableEditor
      id="tableEditor"
      size="xs"
      onDidChange="{(e) => { markdown = e.markdown }}"
    />
  </Card>
<Card>
  <HStack>
    <Text variant="codefence" preserveLinebreaks="{true}">
      { markdown }
    </Text>
    <SpaceFiller />
    <Button
      icon="copy"
      variant="ghost"
      size="xs"
      onClick="navigator.clipboard.writeText(markdown)"
    />
  </HStack>
</Card>
</App>

You can use any static webserver to host the app. You can even run it from an AWS bucket.

For XMLUI Invoice we provide a test server that includes a localhost-only static server, embeds sqlite, and adds a CORS proxy for apps that need that support when talking to APIs (like Hubspot’s) that require CORS. You may need to wrap similar capabilities around your XMLUI apps but the minimal deployment is dead simple.

Web development for the rest of us

XMLUI was conceived by Gent Hito who founded /n software and CData. The mission of /n software: make network communication easy for developers. For CData: make data access easy for developers. And now for XMLUI: make UI easy for developers.

“We are backend people,” Gent says. “All our components are invisible, and when we tried to build simple business UIs we were surprised to find how hard and frustrating that was.”

Those of us who remember the Visual Basic era know it wasn’t always that way. But the web platform has never been friendly to solution builders who need to create user interfaces. That’s become a game for specialists who can wrap their heads around an ongoing explosion of complexity.

It shouldn’t be that way. Some apps do require special expertise. But many shouldn’t. If you are /n software, and you need to give your customers an interface to monitor and control the CoreSSH Server, you shouldn’t need to hire React and CSS pros to make that happen. Your team should be able to do it for themselves and now they can.

I’m having a blast creating interfaces that would otherwise be out of my reach. Will you have the same experience? Give it a try and let us know how it goes!

MCP is RSS for AI

28 May 2025 ~ Jon Udell ~ 2 Comments

We mostly don’t want to read the docs, but we do want to converse with them. When we build search interfaces for our docs, we have always tried to anticipate search intentions. People aren’t just looking for words; they need to use the material to solve problems and get things done. When you create an MCP server, you are forced to make those search intentions explicit. That will be as useful for us as it is for the robots, and will help us work with them more effectively.

MCP Is RSS for AI

LLM series at The New Stack

The Musk Massacre

7 May 20257 May 2025 ~ Jon Udell ~ Leave a comment

The great adventure of my birth family was the fifteen months we lived in New Delhi, from June of 1961, on a USAID-sponsored educational mission. So the destruction of USAID feels personal. I’m only now realizing that we were there at the very beginning of USAID, during what Jackie Kennedy later mythologized as the Camelot era. On a tour of India, at a meet-and-greet in New Delhi, she appears in this family photo.

We must have been at the embassy, she’s surrounded by Americans. You can see a few South Asian faces in the background. The young boy at the center of the photo, gazing up at the queen of Camelot, is five-year-old me.

It could have been a Life Magazine cover: “A vision in white, Jackie represents America’s commitment to be of service to the world.” As corny as that sounds, though, the commitment was real. Our nation upheld it for sixty years and then, a few months ago, fed it to the wood chipper and set in motion a Holocaust-scale massacre.

We suggest the number of lives saved per year may range between 2.3 to 5.6 million with our preferred number resting on gross estimates of 3.3 million.

The shutdown likely won’t kill 3.3 million people annually, say its “only” a million. Per year. For six years. It adds up.

Atul Gawande was leader of global public health for USAID. On a recent podcast he runs some more numbers.

On USAID “waste”:

“It’s 0.35% of the federal budget, but that doesn’t help you, right? Try this. The average American paid $14,600 in taxes in 2024. The amount that went to USAID is under $50. For that we got control of an HIV epidemic that is at minuscule levels compared to what it was before. We had control of measles and TB. And it goes beyond public health. You also have agricultural programs that helped move India from being chronically food-aid-dependent to being an agricultural exporter. Many of our top trading partners once received USAID assistance that helped them achieve economic development.”

On USAID “fraud”:

“When Russia invaded Ukraine they cut off its access to medicine, bombed the factories that made oxygen, ran cyberattacks. The global health team moved the entire country’s electronic health record system to the cloud, and got a supply chain up and running for every HIV and TB patient in the country.”

On USAID “abuse”:

“The countries where we worked had at least 1.2 million lives saved. In addition, there was a vaccine campaign for measles and for HPV. For every 70 girls in low income countries who are vaccinated against cervical cancer from HPV, one life is saved. It’s one of the most life-saving things in our portfolio. Our vaccine programs would have saved an additional 8 million lives over the next five years.”

America has never been a shining city on the hill but USAID represented our best aspirations. In the throes of the Maoist cultural revolution that tore it down there are many other horrors to confront, but for me this one hits hardest.

Who will take care of you in your time of need?

12 Apr 202512 Apr 2025 ~ Jon Udell ~ 3 Comments

This Fresh Air interview with Hanif Kureishi had me riveted from the beginning, for one reason, and then at the end for a different reason. Kureishi is best known as the author of the 1985 British rom-com My Beautiful Laundrette. During an illness in 2022 he fainted, fell on his face, broke his neck, and woke up paraplegic. His account of what that’s like resonated deeply.

Soon after we moved to Santa Rosa a decade ago I became close friends with someone who had suffered the same fate. Until the age of 30 Stan Gow was a rodeo rider, mountain climber, and ski patrol hotshot.

Then he dove into a shallow pool, broke his neck, and spent the next 40 years in a motorized wheelchair.

Before an accident like that you’re an autonomous person, then suddenly and forever after you’re as helpless as an infant, wholly dependent on others who feed you, clean you, dress you, hoist you into the chair in the morning, put you to bed at night, and turn you over in bed during the night.

“You feel like a helpless baby,” Kureishi says, “and a tyrant too.” I saw this happen with Stan. When you have to ask caregivers for everything it feels shameful and embarrassing. Those feelings can convert polite requests into angry demands.

The only escape from that condition, for those lucky enough to be able to own and use one, is the motorized wheelchair. Kureishi has just enough use of an arm to be able to drive himself around the neighborhood. Stan did too, and over the years we walked just about everywhere his wheels could go. Tagging along I gained a deep appreciation for that miracle of mobility, and for the consequences when it’s thwarted by stairs that lack ramps and curbs that lack cuts.

The interview brought back powerful memories of my time with Stan, who died a few years ago after outliving expectations for an injury like his by decades. And then it took a turn when Terri Gross asked about the ethnicity of Kureishi’s caregivers. He was in Italy when the accident happened, and nearly everyone in the hospital was white. When he returned to England it was a different story.

The whole of our huge NHS is run by people from all over the world, and it’s just incredible to lie in bed to be changed and washed by someone and you have these incredible conversations with somebody from Africa, from the Philippines, from India or Pakistan. One of the things you become aware of in these British hospitals is our dependence on immigration.

It’s not quite like that in the US, but much more so than in Italy. During my mother’s final illness one of her caretakers was a Haitian nurse. Mom was a linguist who spoke and taught French, Spanish, and Italian. She’d been unresponsive for a few days, but when the nurse spoke to her in French she perked up like one of the patients in Awakenings.

Paraplegia is rare but helplessness is universal. We all begin that way, we all end that way. Demonizing immigrants is wrong for so many reasons. Among them: who else will take care of you in your time of ultimate need?

Making the Fediverse More Accessible With Claude 3.7 Sonnet

7 Mar 20258 Mar 2025 ~ Jon Udell ~ Leave a comment

A few years ago I abandoned Twitter in favor of Mastodon. Recent events validate that choice and underscore the strategic importance of a decentralized fediverse that can’t be owned by a single corporate or state actor. But while Mastodon meets my needs, much of the Twitter diaspora has gone to Bluesky. That’s fine for now but might not always be. In an article titled “Science Must Step Away From Nationally Managed Infrastructure,” Dan Goodman writes:

Many scientists put huge efforts into building networks to communicate with colleagues and the general public. But all that work and the value in those networks was lost when many scientists felt compelled to leave following Elon Musk’s takeover of the platform (now X). The process of rebuilding on Bluesky is underway, but it will take years and may never reach the same critical mass. Even if the transition is successful, the same thing may happen to Bluesky in a few years.

How can we prepare for a future migration from Bluesky to Mastodon? Bridgy Fed — a service that enables you to connect together your website, fediverse account and Bluesky account — will help. But Bridgy Fed needs to be easier to use. So I recruited Claude’s new Sonnet 7 model to do that.

Making the Fediverse More Accessible With Claude 3.7 Sonnet

LLM series at The New Stack

Web Components

12 Feb 202512 Feb 2025 ~ Jon Udell ~ Leave a comment

The JavaScript industrial complex won’t crumble anytime soon. But the stage is set for a return to an ecosystem of reusable components accessible to business developers, only this time based on the universal web platform and its core standards.

How To Build Web Components Using ChatGPT

LLM series at The New Stack

The Configuration Crisis

14 Jan 202514 Jan 2025 ~ Jon Udell ~ Leave a comment

Perhaps, even though they are not themselves explainable, AIs can help us engineer explainable systems. But I’m not optimistic. It feels like we’re on a path to keep making systems harder for humans to configure, and we keep expanding our reliance on superhuman intelligence to do that for us.

The Configuration Crisis and Developer Dependency on AI

LLM series at The New Stack

The social cost of mediated experience

24 Nov 202424 Nov 2024 ~ Jon Udell ~ 2 Comments

The first time I heard a critique of mediated experience, the critic was my dad. He was an avid photographer who, during our family’s year in India, when I was a young child, used his 35mm Exacta to capture thousands of photos that became carousels of color slides we viewed for many years thereafter. It was a remarkable documentary effort that solidified our memories of that year. But dad was aware of the tradeoff. A favorite joke became: “Q: How was your trip?” “A: I won’t know until the film is developed!” He realized that interposing a camera between himself and the people he encountered had altered the direct experience he and they would otherwise have had.

This weekend I heard Christine Rosen’s modern version of that critique in a discussion of her new book The extinction of experience: Being human in a disembodied world. I listened to the podcast on a hike, my noise-canceling Airpods insulating me from the sounds of the creek trail and from the people walking along it.

It’s complicated. When hiking alone I greatly value the ability to listen to interesting people and ideas while exercising, breathing fresh air, and moving through the natural world. The experience is embodied in one sense, disembodied in another. Reading the same material while lying on the couch would be a different, and arguably more extreme, form of disembodiment. But when I passed a family of four, all walking along looking at their phones, that felt wrong. When people are together they should actually be together, right? You’ve doubtless felt the same when seeing people in this together-but-not-together state.

Lately Pete Buttigieg has been urging us to spend less time online, more time IRL having face-to-face conversations. I think that’s right. There’s no doubt that the decline of social capital described in Robert Putnam’s Bowling Alone has accelerated in the 30 years since he wrote that book. America’s tragic polarization is a predictable outcome. Without the institutions and cultural traditions that once brought us together, face-to-face, in non-political ways, we’re all too vulnerable to being herded into competing online echo chambers that magnify our differences and erase our common humanity.

I won’t be abandoning my mediated and disembodied life online, but I do need to participate in it less and more critically, and prioritize my unmediated and embodied life IRL. The pendulum has swung too far away from the direct experience of shared reality, and that hasn’t been good for me nor for my country,

How To Create Software Diagrams With ChatGPT and Claude

2 Nov 20242 Nov 2024 ~ Jon Udell ~ Leave a comment

Earlier efforts to diagram software with LLM assistance weren’t fruitful, but this time around things went really well. I ended up with exactly what I needed to explain the architecture of a browser extension, and along the way I learned a lot about a couple of formats — Mermaid and Graphviz — as well as their tool ecosystems.

How To Create Software Diagrams With ChatGPT and Claude

LLM series at The New Stack

What Claude and ChatGPT can see on your screen

25 Oct 2024 ~ Jon Udell ~ Leave a comment

“If you work with these cloud platforms every day, you have doubtless forgotten that you ever had questions like these. But every newcomer does. And on a continuing basis, we are all newcomers to various aspects of applications and services. In so many ways, the experience boils down to: I am here, what do I do now?

It’s nice if you can share your screen with someone who has walked that path before you, but that’s often impossible or infeasible. LLMs synthesize what others have learned walking the path. We typically use words to search that body of hard-won knowledge. Searching with images can be a powerful complementary mode.”

What ChatGPT and Claude can see on your screen

Part of the LLM series at The New Stack.

Mix Human Expertise With LLM Assistance for Easier Coding

10 Oct 2024 ~ Jon Udell ~ Leave a comment

There are plenty of ways to use LLMs ineffectively. For best results, lean into your own intelligence, experience, and creativity. Delegate the boring and routine stuff to closely supervised assistants whose work you can easily check.

Mix Human Expertise With LLM Assistance for Easier Coding

Part of the LLM series at The New Stack.

Geothermal power in the North Bay

5 Oct 20245 Oct 2024 ~ Jon Udell ~ 3 Comments

I was aware of The Geysers, a geothermal field about 35 miles north of my home in Santa Rosa, but I never gave it much thought until my first bike ride through the area. Then I learned a number of interesting things.

It’s the world’s largest geothermal field, producing more than 700 megawatts.

It accounts for 20% of California’s renewable energy.

The naturally-occurring steam was used up almost 30 years ago, and steam is now recharged by pumping in 11 million gallons of sewage effluent daily, through a 42-mile pipeline, from the Santa Rosa plain.

That daily recharge is implicated in the region’s frequent small earthquakes. (But nobody seems too worried about that, and maybe it’s a good thing? Many small better than one big?)

An article in today’s paper reports that AB-1359, signed last week by governor Gavin Newsom, paves the way for new geothermal development in the region that could add 600 megawatts of geothermal production.

How much electric power is that? I like to use WolframAlpha for quick and rough comparisons.

So, 2/3 of a nuke plant. 4/5 of a coal-fired power plant. These kinds of comparisons help me contextualize so many quantitative aspects of our lives. They’re the primary reason I visit WolframAlpha. I wish journalists would use it for that purpose.

Making a Vote Forward checklist

30 Sep 202430 Sep 2024 ~ Jon Udell ~ 1 Comment

In How and why to write letters to voters I discussed Vote Forward, my favorite way for those of us who aren’t in swing states to reach out to voters in swing states. The site works really well for adopting batches of voters, and downloading packets of form letters. As I close in on 1000 letters, though, I’m finding it isn’t great for tracking progress at scale. Here’s how my dashboard page looks.

With 50 bundles in play, many of which are farmed out to friends and neighbors who are helping with the project, it’s become cumbersome to keep track of which bundles are prepped (ready to mail) or not. Here is the checklist I needed to see.

VoteForward Dashboard Report

mmorg: 1-UNPREPPED
r23Pp: 2-UNPREPPED
v9Kbo: 3-UNPREPPED
wLMPw: 4-UNPREPPED
24L4o: 5-PREPPED
4nNnj: 6-PREPPED
5rQmV: 7-PREPPED
...
YV4dL: 48-PREPPED
zKjne: 49-PREPPED
ZrKJz: 50-PREPPED

If you’re in the same boat, here’s a piece of code you can use to make your own checklist. It’s gnarly, if you aren’t a programmer I advise you not even to look at it, just copy it, and then paste it into your browser to have it open a new window with your report.

Vote Forward checklist maker (expand to copy)

javascript:(function(){
  // First part: Adjust height of divs with inline styles
  document.querySelectorAll('div[style]').forEach(div => {
    let inlineStyle = div.getAttribute('style');
    if (inlineStyle.includes('position: relative')) {
      div.style.height = '20000px';  // Set the height to 20000px
    }
  });

  // Introduce a delay before processing the list of items
  setTimeout(() => {
    const items = document.querySelectorAll('li.bundle-list-item.individual');

    let dataList = [];

    // Iterate over the items to capture data-testid and ID
    items.forEach(item => {
        let dataTestId = item.getAttribute('data-testid');
        
        // Use the id attribute of the input element to extract the ID
        const toggleInput = item.querySelector('input.slide-out-toggle');
        const toggleId = toggleInput ? toggleInput.getAttribute('id') : '';
        
        // Extract the ID part from the toggleId pattern "toggle-24L4o-PREPPED"
        const id = toggleId ? toggleId.split('-')[1] : 'ID not found';

        // Remove "bundle-" and the number part from dataTestId, keeping only "PREPPED" or "UNPREPPED"
        dataTestId = dataTestId.split('-').pop();  // Extract only the "PREPPED" or "UNPREPPED" part

        // Push the data into the array
        dataList.push({ dataTestId, id });
    });

    // Sort first by whether it's PREPPED or UNPREPPED (descending for UNPREPPED first), 
    // then by the ID within each group
    dataList.sort((a, b) => {
        if (a.dataTestId.includes("PREPPED") && b.dataTestId.includes("UNPREPPED")) {
            return 1;  // UNPREPPED comes before PREPPED
        } else if (a.dataTestId.includes("UNPREPPED") && b.dataTestId.includes("PREPPED")) {
            return -1;
        }
        // Sort by ID if they belong to the same category
        return a.id.localeCompare(b.id);
    });

    // Prepare the output string
    let output = '';
    dataList.forEach((item, index) => {
        output += `${item.id}: ${index + 1}-${item.dataTestId}\n`;
    });

    // Open a new window with the output in a text area for easy copying
    let newWindow = window.open('', '', 'width=500,height=500');
    newWindow.document.write('<html><body><h2>VoteForward Dashboard Report</h2><pre>' + output + '</pre></body></html>');
    newWindow.document.close();
  }, 2000);  // Adjust delay as needed
})();

Here are instructions for Chrome/Edge, Safari, and Firefox. You might need to tell your browser to allow the popup window in which it writes the report.

Chrome/Edge:

Open the VoteForward dashboard in your browser.
Open the developer console:
- Windows/Linux: Press Ctrl + Shift + J.
- Mac: Press Cmd + Option + J.
Paste the code into the console.
Press Enter to run the code.

Firefox:

Open the VoteForward dashboard in your browser.
Open the developer console:
- Windows/Linux: Press Ctrl + Shift + K.
- Mac: Press Cmd + Option + K.
Paste the code into the console.
Press Enter to run the code.

Safari:

Open the VoteForward dashboard in your browser.
Enable the developer console (if it’s not already enabled):
- Go to Safari > Preferences.
- Click the Advanced tab.
- Check “Show Develop menu in menu bar” at the bottom.
Open the developer console:
- Press Cmd + Option + C.
Paste the code into the console.
Press Enter to run the code.

It would be nice to have this as a built-in feature of the site but, as we come down to the wire, this may be a helpful workaround.

Thanks, again, to the Vote Forward team for all you do! It’s a great way to encourage voter turnout.

deo absente deum culpa

20 Sep 202420 Sep 2024 ~ Jon Udell ~ 4 Comments

On a recent trip I saw this pair of Latin phrases tattooed on the back of a flight attendant’s arms:

Left: Deo absente. Right: Deum culpa.

I took Latin in middle school, and could guess what the combination might mean. It’s not a common construction, and a search seems to confirm my guess. Both Google and Bing take you to a couple of Reddit posts in r/Latin.

Would this be the correct translation?

A song I like, Deus in absentia by Ghost, has that line in it intending to mean “In the absence of God”, so I was looking into alternate translations/syntax of the phrase intending to mean “In the absence of God; Blame/Fault God”. Would this make sense: “Deum in absente; Culpa Deus” or “Deus Culpa”?

Does the phrase “Deus In Absentia, Deus Culpa” make sense?

I’m using this for a tattoo and want to be absolutely sure it works in the sense of ‘In the absence of God, blame God’. All help appreciated!

Is that the same person I saw? If so, the responses in r/Latin seem to have guided them to the final text inked on their arms. And if so, the message is essentially what I had guessed. The intent of the message, though, is open to interpretation. I’m not quite sure how to take it. What do you think it means? Would it have been rude to ask?