cd ~

Standards and Monoculture: The Curious Case of Multiformats

2024-07-31T00:00:00+00:00

Abstract

A funny thing happened when I tried helping Manu Sporny and the current maintainers of Multiformats propose a Working Group at the Internet Engineering Task Force in the hopes of moving the registry to IANA governance: Multiformats was publicly and loudly declared “harmful” (a long-standing IETF euphemism for “net negative” or “bad at standardizing”) by one of the most seasoned and respected IETF standards-bearers, and we got sent back to the proverbial drawing board. This article will not seek to argue that Multiformats is benign or useful (Aaron Goldman already provided an excellent point-by-point rebuttal to which I could not add one useful iota). Instead, it will first dissect what cultural and ideological assumptions underlie both sides of the argument– and reformulate what a Multiformats working group at IETF should be good for, before defining a “checklist” and next steps.

If that sounds boring and navel-gazey to you, feel free to jump to the “Coalition and Collusion” section for next-steps on rephrasing the ask, on a sounder foundation for proving usefulness to various communities and translated into the specialized jargon and community norms of the IETF.

Protocol, Optionality and Translation

The first two sentences of the first of the eight theses that Mike Jones nails to the door of Multiformats (ranked, one presumes, by relative harm) are, I think the ones I take the most issue with:

Multiformats institutionalize the failure to make a choice, which is the opposite of what good standards do. Good standards make choices about representations of data structures resulting in interoperability, since every conforming implementation uses the same representation.

Despite multiple attempts in-person and on the official mailing list provided by IETF for discussing the proposal, most notably Aaron’s listed above, Jones is still publicly crusading against the degree of “optionality” offered by Multiformats. For example, in a recent appearance live on stage at EIC Berlin, a trade fair for middleware vendors in the OIDC ecosystem that his own standards created and continue to “regulate”, Jones ranked various contemporary standards efforts on their approach to “optionality”, as part of a broader thesis about optionality in the standards-making process. With all the playful authority of a college professor, Jones gave Multiformats failing grades against the rubric of optionality:

Now, a big chunk of Jones’ career in and outside of IETF has been spent triaging and reforming the optionality problems baked into the JWT stack upon which trillions and the security of today’s web depends, so I can definitely sympathize with seeing optionality and interoperability as antonyms, or seeing too much optionality as a major failure mode against which infrastructural protocols must be protected. Optionality upstream (at the protocol level) has, historically, proven to be very hard to correct after industry implements products, price points evolve into stable markets, and SLA contracts are signed. Many of the decisions taken early in the JWT design process assumed different timelines and a less structural role for these protocols: who could have guessed just how much commerce (and how much ruthless market consolidation) would occur downstream of these specs?

My problem is not with the argument against optionality in identity or security protocols, which I am woefully underqualified to argue with any career protocol engineer, much less against one with as much firsthand experience as Jones. As Aaron Goldman eloquently put it, Multiformats is not a protocol full of optionality, it is decoupling mechanism that lives on a different layer, enabling lossless translation between the representational profiles of multiple protocols (wire formats, data systems) which [rightly] constrain optionality on their own layer of a [messy, heterogenous, real-world] stack. Multiformats has no intention to be (and realistically would be terrible at being) a “protocol” in the sense that OAuth is, or the JSON family of encodings and subprotocols stacked on top of it; a better analogy would be JSON, or CBOR, or protobuf, a token system more than a protocol for tokens. Furthermore, multiformats is not even meant to be a “good” token system that handles discreet pieces of sanitized and orderly data; it was designed to be a system for handling any data, messy, existing, realworld data in any format rather than a net-new allowlist of eligible formats meeting a criterion decided in advance and working together nicely. Put bluntly, Multiformats is only useful in a world where monoculture has not yet been succesful in eradicating semantics from engineering.

Besides, “too much optionality in standards” is like “too much cowbell” or “the commodity theory of value”, or any argument that hinges on “first principles”: it is an unfalsifiable “best way of doing things” argument that I find orthogonal to what I consider the goal of “good standards,” even if it is entirely defensible as a goal of good protocol engineering. Namely, I think “good standards” are ones that build consensus and a stable common ground between market actors who already know they disagree and will continue to disagree on the best ways of doing things. Me entering into a “best way of doing things” argument with Mike Jones wouldn’t be bringing a knife to a gunfight, it would be bringing a slingshot to a nuclear war. More importantly, it would be an argument orthogonal to Multiformats’ actual goals.

My (personal and professional) problem is, firstly, with the presumption that Multiformats is an identity or a security standard in the first place. That is entirely on me for drafting a proposal that enabled that misunderstanding, to be honest. Meas culpas, dear reader. Multiformats is anything but an identity protocol with strong security guarantees: it’s optimized for exactly the opposite properties as OAuth or OIDC are fine-tuned to deliver. It’s a translation layer and an annotation protocol worth using in certain contexts to get data loyally in and out of tightly-specified systems like OIDC or wire formats with unique constraints and efficiencies. In any case, I’ll write a separate blog posts months or years down the line when I have a new draft proposal for review and I want to show my notes, hopefully on the basis of a crowdthinking process this blog post invites you to contribute to.

My (political) problem, in the meantime, is with the phrase “what good standards do”.

What is a standard anyways?

Is interoperability (and economies of scale) the self-evident and only goal of “good standards”? Can “standards” (or just “IETF standards”, more pressingly for this case) usefully preserve or postpone consolidation and monoculture? Here things get a little more “Meta,” in the sense that many open source and decentralization communities use the word when designating a “Meta” category to their public “improvement process” and document-publishing systems.

Here I think there has been some soul-searching within IETF in recent years, trying to come to grips with the inevitable multi-stakeholder politics at play in any open decision-making process, regardless of whether they are deciding technical recommendations with global industrial consequences or budgeting for a social club. Take, for instance, a recent “Informational” RFC (it would be called a “Meta” document in many younger processes like those of blockchain Improvement Proposals) authored by Mark Nottingham, another influential IETF thought leader with a long list of widely-structural Web standard RFCs to his name contemplates the limits of objectivity in standards. “mnot” writes:

Successful specifications will provide some benefit to all the relevant parties because standards do not represent a zero-sum game. (Src)

Identity and security standards, within the scope of one or two Area Director’s roles mediating between different working groups, could be expected to make painful zero-sum decisions from time to time, since the status of a consensual “recommendation” from IETF or W3C is meant to constitute some kind of coordination and prioritization across that bounded set of stakeholders. But that is a narrow and rare case, which IETF is optimized to deliver on yet which applies most squarely to established core protocols of the internet which are already being contested between major swaths of the internet industry. Most other standards, whether at IETF or W3C, represent decisions made by consensus among disparate stakeholders not already structured into working groups and area director “tracks”.

What kind of useful is multiformats, then?

Phrasing the value-proposition of Multiformats (without using ambiguous and contested words like “protocol” in a way that invites misapprehensions and category errors) is hard work, and like I said, ongoing work that needs to be crowd-sourced to be authentic and appropriate to a broad-based, consensus-driven organization. More importantly at this stage is a similar, but upstream question:

To whom is Multiformats helpful?

Protocol Labs underwrote a lot of multiformats work over time, for the simple reason that Multiformats proved very helpful to the elaboration of IPFS qua file system and toolchain, but also as ecosystem which made a place for everyone, even those who had no use for IPFS but found some marginal utility in sharing infrastructure or economies of scale with the IPFS world. As the IPFS ecosystem transitions to more diverse governance and use-cases, however, different topologies have taken shape: the once universal file system and components specific to the Amino Network (aka “the public DHT”) are decoupled from different toolchains for different networks, and Multiformats is less the conformance tool for keeping all IPFS data “Amino-friendly” and more the lingua franca of dialects drifting apart in different climates.

Multiformats is a useful alternative to JSON because it is extremely efficient and optimized for “packing in” many different kinds of data (ASCII and text, sure, but also binary, quirky blockchain wire formats constrained by eccentric engineering choices, etc) into a dense and flexible binary format. It inherits a lot of syntax and ergonomics from Protobuf, but without the mandated schema language (if anything, one way that IETF could benefit multiformats is bringing some experience to the process of hardening some schema options or interfaces for the BYO schema nature of IPLD…) Its closest neighbor is most likely CBOR, but differs mainly in its generally looser syntax (most notably for its lack of a mandatory length prefix, but also for its composability and stream-optimizations). Another key difference from CBOR is its larger and more heterogenous registry of supported codecs having, from the start, been governed and optimized for different use-cases and different priorities.

A common theme, as you can see from this high-level comparison, is that multiformats was designed from the beginning to be an open-world translation language optimizing for flexibility above efficiency, and trying to support a vast superset of data structures beyond those used internally by IPFS, and beyond today’s known data generally speaking. Whereas so much of IETF is about tradeoffs and optimizations for tightly-scoped problem spaces, Multiformats was always intended to be a global data translation layer, not the best possible data handler for any one kind of data.

Diversity and Resilience

To summarize a bit, as Aaron so elegantly already did in the above-linked mailinglist tour-de-force, multiformats’ superpower is decoupling wire formats and contexts from data, allowing a global file system (or, if even that is too much overhead, a general-purpose data language, aware of its own multiformatedness) to superset all the more focused protocols and data languages hardened and matured at IETF. That might seem like a fool’s errand or, worse, naïve infrastructure more useful for phishing and piracy than for legitimate uses of the internet if you assume the best way of doing things at scale is always planned in advance (and at IETF, generally).

But if your takeaway from the last few years of global-scale outages and supply-chain meltdowns is that monoculture makes too many honeypots and attractive single points of failure, economies of scale might not always be an unalloyed good. Solarwinds may be the most dramatic software supply chain failure in recent memory, but fundamentally, no purely technological solution can help (despite many relevant building blocks moving through IETF at the moment!). When any mission-critical piece of software is being updated on that many computers around the world at once, the stakes are too high and the temptation too great; it is a business problem and a governance problem how to economize less, how to allow a more patchworked landscape made resilient by its diversity. As SwiftOnSecurity wrote at the time, the proper framing for cybersecurity isn’t perfectionism (i.e., I can’t afford to take a single hit, there are too many systems riding on me) but attrition (i.e., minimize the damage of any one hit and win the long game).

Two years on, it doesn’t seem like the software industry has done much soul-searching about its teleological drive to always globalize and always economize, doubling down in the face of all risk. Instead, we’re right back to debating yet again if global delivery of patches is even a viable premise after another embarassing global meltdown that grounded over 5000 flights Scaling everything up as much as possible also scales up risks that undermine the alleged progress our technological society is making; all the nation-state-level budgeting that cybersec at global scale requires just raises the stakes beyond what any number of individual meatbags with therapists and families can realistically deliver, because humans still fail and humans are still on both sides of the capture-the-flag game. Having all the humans on the defensive side of the game work for only a handful of companies (even “indirectly”) and defending one common technological edifice kind of defeats the point of having market capitalism, making for a fairly brittle and late-Soviet security model. All technological systems turn into sociotechnical systems when money and human data start flowing through them, so in many ways the most effective way to decentralize away the next Cloudstrike or Solarwinds point of failure isn’t cleaner layering or better technical standards at all… it’s anti-trust enforcement and economic regulation to protect any single point of failure from being upstream of that much international business in the first place.

Perhaps the opposite of one-size-fits-all corporate monoculture hasn’t yet found a universal mot juste, but we are currently living in a kind of bibliographical avalanche of internet architects begging for a more diverse and multivalent, decentered internet before we are left with a few website and a few [American] corporations policing world speech and information flows, to say nothing of capital flows. Stretching back decades, both from within IETF and from core contributors to the internet we already have, there is an alarm people are raising over and over again about an over-consolidated, over-optimized, over-economized internet with a shrinking number of control points and a worrisome number of continent-crippling “single” points of failure. Extending Leslie Daigle’s metaphor of a “climate change” event horizon for the internet, Robin Berjon and Maria Farrell argued convincingly that the word we’re all looking for is “rewilding”, and maybe they’re onto something.

Coming back down a bit from such lofty, philosophical perspectives, we could simply point to recent history at IETF as increasingly the history of diversification, end-to-end criteria of value, and increasingly holistic thinking. Many influential “informational” and process RFC in recent years have walked back some of the previous zeal for “one-layer-at-a-time” protocol design, hammering on excess faith in layering and thinking more historically and dialectically at the processes that ossify protocols and entrench “middleboxes”. On the networking layer, this had led to more adaptive and emergent protocols like QUIC, which bucks the trend (and breaks many stereotypes about the “kind” or “style” of work done at IETF), even overtly and deliberately resisting traditional versioning to prevent ossification as QUIC support moves into commodity hardware and low-level software.

This needs to be the tone set by the Working Group charter and initial documents, if the Working Group wants to roll with these dynamics and meta-minded currents in IETF design culture.

Coalition and Collusion

So, the entirety of the above was a long-winded free-write workshopping a different approach to the content of the working group charter and specifications, based on feedback I got from friends, strangers, working group chairs, and area directors. If you’re not involved at the textual level in that editing process, you are probably quite safe not caring or even completely tuning out that aspect of the process. Realistically, it only affects the non-normative and process sections of those texts which, to be honest, many downstream users will happily ignore or at most skim anyways.

But the same IETF elders and community managers also gave me some additional non-technological advice that was far more blunt and operational:

Next time, bring a village.

IETF takes its neutrality quite seriously, even though most participants driving work items and taking on organizational roles are doing so representing a long-term employer as part of their career trajectories. These professionals are managing ideas and technologies rather than products for their employers, and in many ways have graduated up to this more esoteric and fraught management role after proving themselves good at managing teams and products. That might superficially or cynically seem to contradict how seriously they [can afford to] take neutrality, but that would be selling short the role of ethics in professional comraderie and norms: it is exactly on the basis of a shared familiarity with and even temptation to partiality that colleagues build trust and “see something, say something” when the nearly-invisible lines are crossed.

This explains in large part the “affiliation politics” that make email signatures and offhand jokes about “dayjobs” so central to the life of a standards mailing list. Without a long-term, public community organization behind the primary proposer of a new work item, there is a natural tendency to presume the worst, i.e. that some random learning-impervious consultant is just a proxy for some corporation big enough to pay such a consultant without taking credit and blame for the work item.

Standards of Independence

Another natural skepticism I was told to go out of my way to forestall was the mythic “one company spec”. This term refers to an infrastructural building block that one company would stand to gain most from stabilizing, whether by capturing core value in an open ecosystem or by locking in economies of scale, etc etc. “No company, no matter how large or small, is welcome to bring a one-company spec to IETF,” one long-time IETF volunteer told me proudly at a new-attendee mixer, and interrupting a conversation about Google and W3C I was having with someone else in the process. “No one volunteers to chair a working group here unless they’ve been doing this long enough to sniff a false community.”

I teased him that collusion, like pornography, is hard to define but everyone knows it when they see it to change the mood a little. But after a polite chuckle, he insisted that, no, really, it takes five minutes of googling to find out a company’s cap table, and ten more to figure out their business model if you’ve been paying attention to the industry long enough. Companies, he told me, are public commitments, and after a little workshopping, we talked through how non-profits, and even cooperatives and collectives, are still public commitments in ways that private contracts and grants and projects are not; affiliations are the badges and uniforms that organize the game and save a lot of time whinging about conflicts of interest and ulterior motives.

Similarly, another career IETF engineer from the telecommunications industry told me that the use-cases documents that usually get edited by working groups before normative spec editing begins often takes years of work, or in the case of telcos, millions of dollars of research-department spend, validating market hypotheses and finding actors in markets that will be affected by a new invention or protocol. He’d never had a organizational role, but he was quick to volunteer that if it was him, he would be more convinced by a coalition that involved very different kinds of companies than a large number of similar companies, particularly involvement from a cloud or edge-compute or DevOps company that had thought through what a deployment at scale of a hardened data standard could mean at regional or global scale. Downstream and upstream alignment matter more than coalitions of competitors, he told me, no matter how friendly or coop-itive.

Competition and Protectionism

I asked many long-time IETF-watchers how important it is for there to be “multiple independent implementations” of a thing like Multiformats, and I was told that metric is often overextended to be a general rule of software’s independence. The re-implementability and maturity of multiple implementations matters more at higher levels, where a single implementation being first to market could tend towards a monopolistic or single-point-of-failure market outcome. But this is somewhat off-topic for something so many layers lower down the stack, unlikely to be the “engine” or differentiator of a product or straightforwardly impactful in the market dynamics of internet infrastructure, even in cloud software or distributed computing contexts. At this much lower level of complexity, Multiformats is more of a low-level competitor to (and distraction from) CBOR, Protobuf, named-information hash, dataURIs, and similar stable, mature standards.

Overlapping subsets of Multiformats functionalities having been implemented in many different programming languages and cloud environments, going all the way to production in a diverse set of product categories and form factors, is a great argument for its maturity as a style of programming. Diversity of implementation and context is very useful evidence that Multiformats are an organic aide to interoperability across disparate ecosystems, and this would be useful evidence towards a kind of meta-protocol diversity argument as outlined above. However, this diversity is less helpful, maybe even at cross-purposes, on the topic of showing peaceful or constructive coexistence with the load-bearing data languages of IETF’s “mainstream” (CBOR, JSON, Protobuf).

Here, the most useful narrative needs to be that there is a community of diverse and independent implementers using Multiformats instead of the more mainstream choices for equally diverse and independent reasons, which need to be captured in documents and submitted. Requesting that Multiformats have a place at the table alongside more mainstream options requires documentation of what previously-undocumented (or at least underdocumented and underprioritized) use-cases are worse served by those mainstream options. It also requires documenting the equivalences and translatabilities, since IETF blessing two alternative solutions to the same problem can create “translation burdens” for public, heterogenous systems. This takes more than a few emails to the mailing list or light “community work”: this is “field research,” in a sense, and best done in the context of community-building, ideally not under the banner of a single company or a company-centric ecosystem.

Standardization as the Normalization of an Impersonal Passive Voice

That kind of research might sound like a lot of work and a precondition to further work, but it can easily be parallelized and crowdsourced given the right community support and feedback loops. One great piece of feedback I got was normative specifications (whether independent-track or through a long-term working group) do not fall from heaven fully formed, but are the end-result of a process that is often begun years earlier within IETF, or more precisely, within IRTF.

Perhaps anyone reading this is familiar with the time-honored hack of referring to any specification as an “IETF specification” by writing it up in IETF format using the IETF tooling and linter, mailing it to a mailing list, and thus having it hosted on the IETF Datatracker at an “ietf.org” permalink. Every Tom Dick and Harry with something to sell to engineers can register a burner email and sign up for an IETF mailing list, use this trick, and suddenly have an “IETF spec” to point to as the basis for whatever product or prototype they want to dress up in standardized threads. The trained eye will click the IETF link, notice “internet draft” in the header, check the Datatracker frontmatter for approval status and disregard this as just some opinion on the internet, or more precisely, as what IETF terms an “internet draft”.

There are many shades of grey, though, between normative specifications that have gone through full review and received IETF recommendation status on the one hand, and some rando’s “internet draft” on the other. Less strenuous technical review, which does not include asking the million dollar question, would this be helpful or harmful if recommended by IETF and deployed at internet scale?, is done by the IRTF, which is the “research” wing of IETF, mostly on use-case, requirements, and descriptive rather than prescriptive technical specifications. These latter, “descriptive” RFCs are called “experimental RFCs”, in the sense that they are for pre-standardization or “nightly”-grade technologies, not “stable” ones by IETF standards.

What is a “descriptive” technical specification, you might be asking? A specification describing the consensus or harmonized techniques of a community of practice fit squarely in this category, which might be a pretty good fit for where Multiformats is today. There is even a decentralization fanclub in the IRTF called the Decentralized of the Internet Research Group, which takes very seriously the documentation of organic and community experiments in decentralized technologies, decentralized registries, and data translation.

If some aspects of the Multiformats project are seen as relatively stable and unlikely to see major innovation or iteration, and particularly if they do not require the visibility and formalized management of an IANA registry, they could simply be documented as “experimental RFCs” to build up that documentation-base for the community of practice around Multiformats. Since the decision to make the binary form of CIDs the normative and definitive form, and for multibase to be broken out as an optional “last-mile” app-level convenience, multiformats doesn’t exactly need multibase anymore. (Minutae alert: it’s also worth mentioning that despite the original design of the multicodecs table, multibase is no longer what IANA would consider a subregistry of multicodecs; reserving a few values to preserve backwards compatibility would adequately protect the multibase projects from collisions if the authoritative multicodec registry moved into IANA and multibase stayed outside of it.) This specific work process could happen independently and completely in parallel of the binary Multiformats specifications and the [fully-normative] Working Group charter, or be taken up first while more substantial community-building and governance is proposed to make the latter more convincing.

This Is What Community Governance Looks Like

If all of the above sound like “too many options” and you were hoping for an update on what the one obvious path forward is, I have some bad news about community governance and open source. Whichever of these options has the most community behind it are the options multiformats takes, and each moves as quickly as multiformats users helps it move.

The hackmd table formerly known as the IPFS Foundation is very much under construction right now, but one possible route forward is for that foundation to become a tent big enough for all the various use-cases and stakeholders downstream of Multiformats, in all its self-describing glory. A Foundation (or, well, any other community organization that served as a lowercase-F foundation) that brought together stakeholders and research and early-stage market thinking about content-identifiers, multiformatted-data, and maybe even multiformatted data LINKED NATIVELY to more multiformatted data (!) could be an interesting venue for organic connections and collaborations to emerge from. Such a community org would make a great foundation from which to grief IETF with all kinds of normative, non-normative, and descriptive documents that advance the diversity and resilience of multi-protocol software.

Or, conversely, the Multiformats working group might just… not make sense for IETF yet. Community shouldn’t be measured by organizations but by their organizing, as the union-builders say; a community organization without a self-organizing community to fill it and make hard demands of it is just an empty vessel.

If you have opinions, the IETF already gave us a mailing list and it’s still the best place to pose (open, archived) questions about any of the above.

ActivityPub and Newsrooms: Thinking Past Platforms

2024-07-24T00:00:00+00:00

The Great Exodus That Wasn’t

After Elon Musk raised profane amounts of capital to buy Twitter by appealing to Saudi and Silicon Valley resentment of the current management of the world’s first and last open newswire, digital editors and news outlet management worldwide were, understandably, shook. The Wall Street Journal reported Ralph Ellison’s rationale for investing a billion (give or take whatever Elon saw fit) as follows: “It’s a real-time news service, and there’s nothing really like it,” he told me. “If you agree it’s important for a democracy, then I thought it was worth making an investment in it.” (Src) News outlets and individual byline-holders alike had evolved whole technological toolkits, middleware markets, and career trajectories around the Twitter platform over more than a decade before these sudden changes, and in large part it can be speculated that it was exactly this symbiosis between news and the Twitter platform that the bulk of Musk’s financial backers wanted to disrupt and rebuild differently.

In the almost two years since, the Great Exodus many were expecting never quite happened: while Twitter’s centrality to news is indisputably diminished, the landscape has gotten more and more fragmented rather than swinging markedly towards a new center of gravity. Just as no “winner” has yet emerged, neither has a paradigm shift yet definitively occured towards decentralized and/or federated models, although both seem to be steadily gaining ground and on track to terraforming that landscape by the end of the decade. Perhaps two years is an unrealistically short time to expect such results, and the balance of power between the Empires of Network Effects should be studied on a more glacial and geopolitical time-scale, as do nuanced shifts in the Modes of Content Production and Monetization.

Even if the medium-term future still feels like a toss-up, news outlets need to be planning their next moves and their long-term direction, carefully balancing engagement today against dependency tomorrow. While a “more direct” relationship to readers that they can own and manage is appealling, reach and monetization are drastically harder to secure in the “homesteading” model, and at least for the near future, the more fragmented and competitive terrain has at least lowered the “reach tax” that global platforms on steadier footing charge publishers for visibility where their readers are already looking.

The purpose of this essay is to overview the technical and architectural tradeoffs in each option on the spectrum from “self-hosted” social to mega platforms, and hopefully to trouble that linear mental model a bit by introducing the idea of an open social data graph cross-cutting that spectrum and undermining its assumptions.

After describing various platforms qua platform, I’ll try outlining how federated self-publishing and a portable social graph based on it can function almost like a shared Content Management System, with a built-in identity layer to power a “comments section” governed locally but reaching globally. Admittedly, the path to monetization or advertising platforms based on this “public option” for open data is hard to predict in 2024, just as it was on the open web in 2004. But for news publishers that can afford to think that far in the future, federation protocols offer a less toothless model for publishing on “the open web” that undermines the winner-take-all “attention economics” that vertically-consolidated megaplatforms used to enclose the commons of the web over the last two decades.

Actual Exodus Data

We can begin to situate the post-Twitter publishing landscape by looking at some interesting survey data from the American Press Institute. At the Online News Association in February 2023, the API shared the results of a survey they conducted while the topic was still front-of-mind for online editors everywhere:

First off, some might be surprised to note that for almost half of the parties surveyed, a Meta property (Instagram or Big Blue) was the next-best thing to Twitter and its closest replacement. Others might surprised how many trade and specialist outlets shifted Twitter resources and adspend towards LinkedIn, with its effective industry-targeting and tie-ins to trade show organization, or how many focused on SMS messaging or classic newsletters. Mastadon [sic] hardly registered on the survey, which is not so surprising given that news is a mass-market phenomenom driven by big numbers, big data, and advertising, none of which can really be called capabilities of Mastodon qua software or qua userbase.

My main takeaway from that presentation was that until Elon inflicted his management style and brand safety philosophy on Twitter, the platform had been so useful as to be a cornerstone of online strategy for a wide range of diverse news outlets. Each of those, a year into the Elon era, splintered off to a different platform depending on what audience it primarily strives to cultivate, and what kind of engagement powers its revenue strategy. Throughout its golden age, Twitter had been something like the “public option,” outperforming and outrunning the more tailored “News” platform products that Apple and Google tried to make for penning news outlets into a closer feedback loop with advertising and discovery. One advantage it had over the duopoly efforts to kettle news publication lay in its DNA as a “social media website,” in the sense of an account-based identity system fully of lively, authentic, and empowered users powering the algorithm in real-time like a hive mind.

A less obvious advantage, which I will elaborate on below, is that after so many years of symbiosis between a huge userbase and various recommendation algorithms and search tools, Twitter was sitting on a global data lake without which it could never have become a “democratized news wire”. No future open news wire on a similar scale is imaginable without a comparable ocean of user-generated content and interest-metrics, ideally being ingested and influenced by a competitive ecosystem of algorithms and search tools (rather than a vertical monopoly over them owned by the same private concerns controlling access to that underlying wealth of historical, mostly-authentic data).

What We Lose When We Lose Twitter

Over the years that Twitter was ossifying into something like critical infrastructure for information distribution, it achieved a kind of invisibility and was taken for granted in ways that few commentators outside politics and news really recognized. A “tweet” became a kind of low-level primitive for the web, so low-level that it can be hard to even remember this evolution or name the attention pathways and copycat UXs it created. Tweets beat out “Facebook posts”, “Reels”, “Carousels”, “tumblr threads,” “[Medium] blog posts”, “subreddits”, and all the other atomic units of social media for mindshare exactly because it was easier to think of them as units of “content” divorced from their static, mostly unchanging presentation on the website or app. Tweets were made for screengrabbing and linking-to, rather than for bringing you into a walled garden, much less an authenticated one.

In economic terms, I would argue that twitter.com (the website and mobile app for reading, writing, updating and deleting tweets) was one of the more valuable things he acquired, even if his confused attempts to rename, rebrand, and redesign the website imply he does not personally agree. Even more valuable, however, is the longevity, the authenticity, and the orderly data language structuring that huge corpus of golden-age tweets. This corpus powered all the interal algorithms and tools that made a well-oiled Twitter neutral enough to be adequate plumbing for news, and it was key to Twitter’s value. While Silicon Valley commentators have historically derided Twitter for not finding a monetization model adequate to its operational costs after all those years and CEOs, that corpus alone (even if all the humans and bots stop using the platform tomorrow) will still be worth a fortune decades from now, after every open platform and comments section is flooded with generative AI runoff.

I believe Twitter was and still is a data and informational substrate in ways that no other “social media” has yet become largely because it provided a “mode of publishing” that took root and became part of the furniture of the internet, eclipsing the mere “real estate” of twitter.com where people went for decades to do that kind of publishing. The “tweet,” as a kind of intuitive and universal user-experience primitive, survives twttr.com just as the “soundbyte” survives the decline of broadcast radio and television, remaining intuitive even for generations that need “broadcast” explained to them by a historian. In this sense, we need to consider the difference between a tactical and a strategic “replacement for Twitter”, in the sense that tactically news outlets need to engage their readership today and build revenue around that relationship yesterday. In the long term, however, news outlets also need to make a strategic bet on “where” the most, best “tweet”-like things (at least in the eyes of their readership) will be ten years from now, or what technosocial system writers and editors will game to get reach with their little tweet-like content-teasers and conversational threads.

Or more precisely than “where”, we might ask in what giant corpus and in what data model these gajillions of grains of content-sand will form giant dunes worth mining and sifting and navigating. I think it is misleading to think in terms of “monthly active users” and growth charts– these are the metrics of commercial platforms and products (alt link), but they are also the metrics of Goodhart’s Law, projecting onto the future a market with the same dimensions and groundrules that govern commercial platforms today. (And those groundrules might change soon, with a little luck and another few anti-trust wins!) While “user metrics” and attention as conventionally measured are a crucial part of what drives and sustains engagement on the competing commercial platforms generating their respective content data lakes, thinking about the long-term health of news (and of the internet itself) requires us to think past (and below) platforms, or at least to look orthogonally at their informational runoff, which in large enough quantities becomes its own center of gravity and takes on a life of its own.

The Cowpath Beneath the Platform

In the vernacular of today’s youths, who grew up swimming in the waters of commercial social media, Twitter is the lightest-weight, easiest-abandoned, most novice-friendly way of establishing a “presence online.” Twitter accounts can be earnest publishers, sincere consumers who want to respond in good faith on the threads of others, invisible lurkers who train an algorithm to surface the content they want to consume without producing a single tweet in years, or insincere trolls and spambots who are at cross-purposes with the OPs of every thread they post to. In all of these cases, however, each account is a kind of micro-ledger of events published into an extremely public and searchable corpus floating in an ocean of other content it links to from every tweet. To create a Twitter account is to hang a shingle, knowing that you will be easy for others to “find” only in proportion to how much engagement you garner from everyone else. Twitter was engineered for decades to be coextensive with “the web” rather than to be a platform with a perimeter around it, giving some sense of semi-public obscurity or even private-mode occlusion, a feature which Meta properties had made table-stakes for more properly “social” media over a decade ago.

By comparison, a typical [young] user today doesn’t feel like they are signing up for a web presence directly on any social media website:

creating a Reddit (or Lemmy) account feels like signing up for a debate club, or some kind of academic senate where the marketplace of ideas is tediusly voted up and down
creating an Instagram account feels like hiring a brand manager whose first piece of advice is to take out an ad in a fashion magazine
creating a Facebook account feels like taking out a halfpage ad in a [Gannett-owned] smalltown local newspaper
creating a Tumblr account (in 2024) feels like getting a booth at an anarchist ‘zine fest– public, but not visible or mainstream in any sense
creating a Github account or a LinkedIn account feels like getting an Employer Identification Number from the IRS, or buying an Amazon Ring, the footage from which can be analyzed by any future employer and the various machine-learning products SaaS products it enables on its Azure control panel.

Partly, my argument here is a bit of a sleight of hand, because the comparison of Twitter to “properly” social-media platforms is a bit of a category error; despite its initial founders’ social-first designs (see Lorenz, for ex.), Twitter really hit its stride (or spun up an organic user-growth flywheel, to be more precise) when it found the right balance of fun and micro-publishing, shit-posting and shingle-hanging, enabled by powerful recommendation engines and feed-sorting algorithms for a diverse set of user stories. In this sense, it might make more sense to situate Twitter not as the most universal of social platforms but the simplest of publication and syndication networks, by being more accessible to more kinds of publishers and allowing readers to “follow” just about everyone they might want to. Here we might think of Twitter as a far simpler, more accessible version of software that organized firehoses of open-web content: Google Reader, user-friendly general-purpose RSS clients, and the mythic webring salad days of the pre-corporate web from which spawned the neologism of the “[we]blog” in the first place. Squinting a little, we could call Twitter the freemium-model, lightweight, toy version of a more customizable “content management system” like WordPress, which is one of the oldest pillars of the modern web, now powering anpit 40% of it and counting.

The most salient aspect of Twitter from the perspective of news is its “democratizing” horizontality, by which casual readers, loyal subscribers, individual by-line reporters, their reply-guys, whole desks or even whole beats across many outlets, and whole newspapers all stand on equal footing before the algorithm and moderation policies. (Well, almost equal, modulo those pesky color-coded stars that have iterated so quickly as to lose all meaning since Musk started moving fast and breaking everything.) This radical horizontality is a pretty hard illusion to pull off, of course. The trick to pre-Musk Twitter was a massive and expensive armature of complementary sorting mechanisms: algorithmic surfacing, search, moderation, and blocking/hiding unwanted or low-quality content creates a usable ocean of content to swim in for all user-classes, obfuscating the exceptions to its seeming horizontality and neutrality. Making one giant comments section for everyone’s micropublications definitely inherits the massively expensive liability of all social media, in that it is made of humans, so if you want anyone to want to spend much time in your little corner of it, you always have to hide a significant percentage of them from one another.

Twitter could be thought of as the simplest-possible, most syndicatable, portable, dumbest-possible data model for publishing, and thus the hardest to kill, the cockroach of CMSs for powering a different 2-digit percentage of “the web” than WordPress manages (measured in individual human participants rather than in domain names, that is.)

Nothing Scares Off Capital Like Utility Regulation

Which brings us to another low-level building block of the web so basic as to be invisible to people who live and breathe it everyday: the ICANN worldview. By this, I mean the almost universal impulse (up and down the spectrum from industry insiders to the most casual of web users) to think of each domain apex as a “real address” on the internet, and every other kind of actor or identity on the web as a mere tenant renting or otherwise borrowing “space on the web” from one. The extended metaphor of real estate for the ICANN “namespace” stretches back to the earliest “dot-com Boom” years in Silicon Valley, but has by now been globalized along with the rest of the quirks and worldview of that time and place.

Central to this view of ICANN namespace as real estate “ownable” (technically, leasable) by Captains of Industry is the entirely US-style property rights that govern that namespace; despite all the multi-stakeholder and performatively international governance of ICANN, the domain-name industry makes it really easy to think of the internet as “real estate” on the Angloamerican legal model. This leads to a kind of universalized and naturalized 1:1 mapping of “real addresses” to legal persons (and the occasional hobbyist nerd), which is now so foundational to the entire web (since at least the dawn of O’Reilly’s “Web 2.0”) that it can be hard to even think outside of it. In the meme above about an internet comprised entirely of screengrabs cross-posted between “four websites,” we see how these platforms (and the “websites” mapping to them 1:1) jockey for a kind of mindshare olympics. This axiomatic approach to web identity has been entrenched further and further in tandem with the dominant mode of web economics (governed by the extended metaphor of “properties” on a Monopoly board and users as tenants).

The economic model this identity paradigm created and maintains is extremely winner-take-all and as such, is getting harder to sell to the global middle-class as it create ever-larger empires lead by captains of industry more powerful than sovereigns. In this game-theoretical mode, how many humans visit or are familiar with a domain apex per month is a pretty good metric for how much mindshare it has in the human world in general, all things being equal and economic actors being rational and all that jazz. This ICANN-refereed competition for eyeballs has reigned since Silicon Valley started pumping money into zero-sum brand warfare decades ago, evolving hand-in-hand with a progressively more domain-bound security, privacy, and integrity model for the modern browser. (Over the same years, those same advertising empires mergered and acquired their way into a governance over the browser and the web platform, overwhelmingly dominated by overlapping duopolies at every layer of the online economy, but that’s a story for another day.)

We could think of the “Web 2.0 cycle” (roughly mid 2000’s to mid 2010’s) as a protracted competition between “individual websites” on the one hand, struggling to own and build engagement on their small plot of land, and megaplatforms housed under a single web domain on the other, aggreggating and synthesizing vast amounts of engagement into data-brokerage empires, monetizing not just their own content but that of the rest of the web. It feels like a cycle that came to a close because the latter won a pyrrhic victory, assuming such an outsized role in the economics of the web that the seemingly infinite growth slowed as the rate of profit fell, while the scale of this epoch’s externalities and liabilities caught up to its winners. What’s more, these mega “websites” are increasingly beset by escalating anti-trust challenges andmassive liabilities that their economies of scale increasingly struggle to outrun (not least of which are the political and ethical quandaries inherent in moderating enduser-generated content on a global scale).

Servers and Sources of Truth

Coming back to news outlets, the division of resources between “one’s own website” and publication channels and one’s accounts on “someone else’s website” has evolved and transformed deeply over the last 3 decades, in parallel to (and rarely independent from) the shape and regulations on web advertising that crosscut the distinction anyways. The pendulum is swinging back a bit, as regulator dreams and enduser desires alike seem to be pointing towards more domain apexes and more local or bottoms-up control web-wide in the coming years, and some kind of guardrails against hyperconcentration of advertising control and distribution control. Like any other attention business, news outlets nowadays juggle what portion of their budget and worker-hours to devote to each platform, what portion to spend on reader feedback channels that they can nourish and control more directly, independent of the direct or indirect role either plays in their revenue story. And this calculus is changing fast, for a number of reasons.

Fundamentally, the platforms are retreating from an earlier position of self-centering, pushing various liabilities (from censorship to moderation to toxic-data reporting to misinformation to AI-generated spam) back onto “local jurisdictions” and property management companies and individual owners of managed properties and civil society groups. As “artificial” “intelligence” floods any less-than-perfectly bot-proofed and sybil-proofed mass communications channel with noxious chum and vapid arguments between formalism-spewing summary-bots, megawebsites can suddenly collapse under their own weight, their fragile defense mechanisms outgunned by cheap, powerful, and inherently adversarial software. The economies of scale break down completely when a billion humans share one “website” with 2 billion, rather than 2 thousand, bots impersonating humans. Our introductory deal is over, and suddenly, no one wants to be in the business of domain apexes delivering content neutrally and reliably– the “source of truth” industry is disappearing fast.

In a way, Twitter was the last platform of its size that could credibly claim to be neutral enough not to feel like “someone else’s website,” a for-profit global mega-platform that didn’t feel like one until its hostile take-over under new management. But now the music has stopped and we have to wonder: which elements of that now historically-impossible bundle matter most? Pick carefully, especially if you’re bringing home any or all of the “social” functions previously hosted in the massive communications chicken-farms of yesteryear.

Each Server a Variously-Unbundled Microduchy

In the American Press Institute poll above, many news outlets described their journey to build close-knit communities on “their own servers.” The presentation limited itself to only the two most popular forms that self-serving took in 2023: Discord servers and Mastodon servers.

I will leave Discord aside for the rest of my argument because I feel strongly that a different, more show-stopping category error pushes them (and Slack) squarely out of scope. Not only are both of their data formats proprietary and their code closed-source, but their commercialization strategy increasingly evinces symptoms of the same shareholder-value-driven enshittification vectors slashing and burning Reddit in realtime, with Slack similarly pressed to extract value from captive userbases. Unlike their “open-core” competitor Discourse, each server in Discord or Slack is not public in the literal sense of published to the open, unauthenticated internet, nor public in the data-engineering sense of publishing in an easy-consumable, standardized, portable data language that can be brought over to a self-hosted or competitor-hosted service.

Mastodon, on the other hand, is something like a reluctant platform, or a “locked-open platform” in the most generous of readings, in that it has exactly the opposite characteristics: the data language in which each Mastodon server encodes all its human activities and publication events is a dialect of ActivityPub, a 5-year-old standard designed in the open at the worldwide consortium. Furthermore, the default (and most robust) configuration of Mastodon is for Twitter-like publication readable by unauthenticated users. On the “write” side, each Mastodon is a tiny feifdom controlling what gets published and what gets rejected, deleted, or blocked; on the “read” side, each Mastodon shares what it publishes freely with every other Mastodon (and many other services as well), making a kind of open-web for the outputs of each fiefdom. This separation of powers makes sense considering that Mastodon’s initial mission, before its data language was even standardized by “averaging” it with two other locked-open social projects to create ActivityPub, was to clone Twitter after Twitter closed off its API and “locked down” its servers to only be accessible from twitter’s own website and mobile clients.

You could say Mastodon rewound history to the point at which Twitter started consolidating power in its website and instead chose to consolidate power at the “Subreddit level”, empowering moderators and allowing communities to self-organize and self-govern as close as possible to the site of content generation. Technologically and economically, this splintering the network into thousands of servers that each design and enforce (and shoulder the liabilities of) their own trust and safety systems. They also opt into and out of federation with other interoperable servers in a completely patchwork and often gossip-based or probabilistic method, as researching those thousands of yes/no decisions and maintaining them over time would be an infinite resource quagmire (and growing geometrically with the growth of the network). Bundling so much of the human and social (and compliance) dimensions of “running a website” at the level of the server leads to an odd kinds of subsidiarity politics: imposing some kind of Mastodon-wide standard for speech or civility (or even for, say, “terrorist content” norms) would be like imposing a code of conduct on a thousands house parties at once. All the credit should go to groups like iftas.org for trying to establish more scalable norms in this paradigm, but it is slow-going work retrofitting standards and norms onto the deliberately heterogeneous patchwork of today’s Mastodon population.

There is another major difference between Mastodon and Twitter, however, and this is painfully apparent in the world of news professionals that have honed their craft to optimize algorithmically-boosted reach: Mastodon does not have an algorithm, or a particularly useful “global” search, much less a central party that could be paid to place ads globally against that search, or boost a given piece of content in algorithmic or hashtag-generated aggregation mechanisms. (By “global” I mean across all Mastodon servers, but even if settled for “global” meaning across all servers within X server-to-server connections of a given server, there is still not much in the way of comprehensive centralized indexing). Global reach, global search, and advertising were specifically not design goals of Mastodon, and even the concept of a “feed” gets thrown out with the bathwater in many accounts of its history and goals. Being searchable on today’s Mastodon’s servers requires a double opt-in (by server management AND by each user), and the Mastodon developer community has been hostile even to “aftermarket” application of algorithmic feed-sorting to the Mastodon server stack. So while algorithms and more powerful global search over a substrate of ActivityPub data from many Mastodon and non-Mastodon servers is entirely possible using non-Mastodon ActivityPub software… that hypothetical software isn’t really available in production today on any server that a newsroom could just install and host at its own social. subdomain.

Contrast all of the above with BlueSky, which is showing no signs (so far) of early-onset pre-enshittification and is credibly friendly, at least in the medium-term, towards independent businesses exploring business models within and on top of its network and open-source code. It has even promised to federate using its own federation protocol that deliberately reinvents much of ActivityPub, making it a kind of alternate-fediverse on simplified rails that are, to date, not getting that much independent usage. This alternate federation paradigm is in its early days and thus almost as hypothetical as a Fediverse-wide global search or reach guarantees enforceable enough for news outlet usage. While 2024 has seen considerable growth in alternate clients (with some innovation at the “AppView” level), BlueSky’s identity servers and relay servers are still effectively the only utility providers involved in the nuts-and-bolts of providing their service and servicing their “firehoses” of content. (A space has at least been deliberately and publicly carved out for future alternate AppViews and independent Relay firehoses, although the economics of how long they take to develop remain to be seen.)

Until newsrooms can self-host “ATProtocol”-powered social. websites that they fully control and federate to the bsky network, their options are still basically to have an account in the bsky network with credible promises of future portability, or stand up a Mastodon, or use some other kind of ActivityPub software to manage its content, and the users it provisions with long-term, moderately-portable identities. It’s worth noting that self-hosted “social servers” (and to a lesser degree, Discord servers) enable highly-configurable special privileges or tiers for users at different levels of loyalty or subscription, which is an important axis on which to compare a future “self-hosted” form of whitelabeled and federated BlueSky to Mastodon or other ActivityPub social services, alongside credible portability, cost-per-user, stickiness, and of course monetization or revenue options.

In many ways BlueSky today feels (to a non-journalistic user) like the closest replication of pre-platform, pre-open-API Twitter. It even has a UX and a userbase that both feel like Twitter’s did before it become a general-purpose information pipeline far-reaching enough and useful enough to double as a newswire and free academia.edu. Honestly, newsrooms could be forgiven for “waiting and seeing” how things develop differently the second time around, with a novel and minimalist federation protocol and open-graph data language: it genuinely could turn out to be the best of all worlds, or the best rearrangement of the elements that Twitter evolved (largely on accident) in a different sequence. But tactically, anyone migrating from Twitter to BlueSky in 2024 feels like they are accepting a major loss in reach, almost as major as moving to a self-hosted Mastodon. It is a choice between two different orders of operations, two different reshufflings of the Twitter playbook being re-run in a landscape that has since changed.

Threads - Selling Picks and Shovels from the Center of the Chessboard

There are many more options than just those two, offering different subsets of the Twitter system with different roadmaps for reaching parity on the rest. There’s an entire garden of forking techdebts that we call the Fediverse, which are variously accessible to the user of Mastodon (or Threads), and variously functional, many of which I will be covering here.

Today’s Fediverse is a vast, unfinished Linux of networks that mostly interoperate and mostly “work” in the way that Linux distributions “work”, i.e. not at all, at least by the standards of the mainstream consumer used to what Apple and Amazon call “just working”. Mastodon (the platform) is actually very distinct from ActivityPub (the data language), which means that while users of one Mastodon server can “follow” and interact with users on any other Mastodon-compatible server, they can also follow (with variously degraded service and a subset of supported features working as expected) RSS-like content streams from elsewhere in the ActivityPub universe, which is actually quite vast and heterogeneous.

In the “full Mastodon API support” category we find Zuckerberg’s newest property, Threads.net. Its top-level domain (.net) is a subtle nod to the humbler, more utility-like aspirations of Meta’s “public option” and, realistically, it is the most viable Twitter replacement in 2024, at least from the perspective of news outlets looking for maximum reach and minimum risk. (I think that if the American Press Institute ran the same survey today, Threads would take a 2-digit percentage of the pie chart, but that’s just tendentious speculation.) Meta may be slightly less credible than BlueSky in its commitment to federation and open-platform economics (by dint of Meta’s size, governance model, and track record), but at least ActivityPub as specified today has certain minimal affordances for and guarantees of portability: even today, a publisher can switch relatively smoothly from publishing on Threads to self-hosting a Mastodon without losing too many features or followers.

Threads may not be as wild or organic as BlueSky, or as addictive or libidinal as Instagram, but that’s OK, it’s not trying to be! I suspect it’s not even trying to be profitable, really, or glamorous, or addictive, or innovative in the conventional sense, except insofar as driving hard into the center of the chessboard and being a no-frills loss-leader public option that grows the pie for all of of social media (or at least all of the Meta family of concerns). It may not be immediately obvious what’s so innovative about this, but one intriguing reading of this business-model innovation posits that as federating with Threads becomes more and more valuable and attractive to the scrappy, bootstrapped, self-hosted servers of the grassroots Fediverse, Meta (or some cottage industry paying Meta for the privilege) could start offering moderation and compliance with Threads’ high standards as a commodity service, billed per-user or per-post.

Coming back to the forward-looking newsroom manager, I would argue that Threads deserves a second look NOT because of its size, its reach, its capitalization, or its market share but its ambitious gambit to reinvent the game and strike a new admixture of silo and open platform. Threads is running a different race than it appears to be: it is trying to beat BlueSky and Mastodon and LinkedIn not to profitability or to critical mass or even to centrality, but to utility, or to put a finer point on it, beat them at becoming an informational utility at global scale, infrastructure as invisible as golden-age Twitter was and Instagram and Big Blug both failed to be for all their overt .com producty-ness. A huge part of this is that unlike Mastodon, it brings world-class global indexing and search and algorithm expertise that it is transfering over to a broader, federated data plane of Threads + Mastodon + GoToSocial (an independent re-implementation of the Mastodon-API), and hopefully soon many more ActivityPub flavors further from the Mastodon API center of power. Embracing federation this openly means taking the hit of enforcing its quality, brand safety, moderation, and compliance norms on content generated by users it doesn’t even serve at the point of federation firehose, which (in business terms) is kind of wildly innovative and novel for a company not known for adopting open standards, open networks, and even a theoretically portable, open social graph. (Or perhaps armchair pundits like myself give Meta so little credit on the open standards front only because Facebook never publicly commented on its earlier attempt to corral friendly competitors to embrace an earlier version of the ActivityPub federation standard in 2008-2009 before walking away from the idea.)

One particularly user-hostile feature of the Fediverse’s radically decentralized, data-first federation to date is that on the level of end-user experience, users have no guarantee when they see a link on their “home server” pointing to, say, unfamiliar.social or @someguy@bumblefudge.com whether it will be a Mastodon link, a non-Mastodon ActivityPub link, or a third unknown thing. To my knowledge no ActivityPub software to date has sorted the namespace of HTTPS URLs into degrees of AP parseability or conformance and surfaced that to the end-user as a different-colored link or other cue that any link might take them partly or fully out of the fediverse and onto the old silo-web. Something I love about the Threads product is that it brings these kinds of user expectations to the forefront of discussion and makes them table-stakes of federation with the biggest body of users, incentivizing upgrades to the system and making a Feta-verse within the Fediverse (itself named satirically during the golden age of Zuckerberg’s earlier rebrand). Whatever the long-term rules of the Feta-verse within the Fediverse will turn out to be (they’ve been very quiet about what advertising and revenue-sharing might be possible in their federate future), having a megacorporation with some utility-experience raising the bar for their subset of federation will definitely expand the vocabulary and business model options for the rest of the fediverse directly and indirectly.

Newsrooms should consider Threads an interesting alternative to Bluesky, Instagram, or Mastodon, since it falls somewhere between all three in vibes, user experiences, and growth possibilities. What’s more, it gives partial access to readers on both Instagram and Mastodon in different ways, and who knows, maybe some day it’ll even partially have touchpoints or bridges with BlueSky as well, or some other networks? Whichever of these platforms continue growing for the next year or two are likely to experiment with at least partial publishing interoperability, if not user-interaction interoperability or federation, so the choice between them is not particularly irreversible or zero-sum.

Oops I Forgot To Mention Another Big Chunk of the Fediverse

Earlier, I mentioned in passing a broader Fediverse that eschews or ignores the Mastodon API, with its sometimes tendentious interpretation of the ActivityPub data language that bends it to the needs of its inward-facing UX goals. (In Mastodon’s defense, it has naturally spent the last many years prioritizing the user experience of its loyal and loud users over an academic interoperability with a smaller userbase in the non-Mastodon fediverse, primarily comprised of its exiles and ex-users until Threads came along to legitimate many of its choices.) Threads has, for the time being, something like 90% of the “active users” of the Fediverse, and before it joined, Mastodon-compatible servers had somewhere around 70 or 90%, but the remaining “rest of the Fediverse” is actually quite significant, growing rapidly and quite important to consider separately for many reasons.

One growing “neighborhood” of the Fediverse today is the so-called “Threadiverse” (no relationship to Threads!) which takes the basic shape of Reddit (link aggregation with upvote/downvote-sorted comments sections). As Reddit has enshittified, pivoting hard into stakeholder value at the expense of its previously autonomous communities and hardballing search engines looking to buy access to their valuable datasets, a much larger exodus of users (particularly the most value-generating power-users, the “mods” or moderators, as well as a good chunk of the non-mod daily posters) has created a massive population of exiles looking for their fix of a UX they grew addicted to over the years. These exiles are increasingly finding homes on indie Reddit-clone servers, or in the case of many veteran subreddit moderators, even pay out-of-pocket to stand up their own. These are federated to those of other ex-moderators, quickly constituting a DIY replacement for a previously robust and deeply-entrenched community, even without the assistance of corporate overlords and their powerful search, sorting, bot-detecting, and other tooling. Like Mastodon, these Reddit clones trade independence and self-governance for the burden of self-hosting, plus a weaker form of “global” search, closer in essence to a search of “this server and the ones it swaps data with.” Also like Mastodon, these sites (Lemmy, kbin, and now NodeBB) enable patchworks of servers that run their own (customizable, often invite-only) user systems and moderation policies to cross-publish, cross-comment, and generally share data and users, owning their own infrastructure but still participating in a broader data lake of upvotes and downvotes and threads. (A fun detail for the data nerds is that since servers and federations are fragile and sometimes fleeting, they’ve had to elaborate their own resilience mechanisms for displaying cross-server threads that survives server deaths and defederations, a fun distributed-systems UX problem to ponder!)

While the Fediverse heirs to Reddit might be less directly interesting to news outlets than other Fediverse form-factors and “neighborhoods”, it is worth noting that unlike the original Reddit, there is a built-in upgrade path by which the Fediverse Reddit threads can be read, followed, and participated in from a non-Threadiverse Fediverse application willing to do extra work displaying it properly and/or separately. The neighborhoods can be as distinct or as merged as individual servers and clients choose to make them over time, because the underlying commonality of data records they generate and share makes the translation and interpretation issues relatively manageable. You could think of Mastodon and Threads as distinct platforms joined at the hip, with slightly different features and governances, but almost completely interchangeable data flowing through the pipes; the same could also be said about Lemmy and kbin, and a little bit of fancy plumbing could federate the Threads-plus-Mastodon federation to the Lemmy-plus-kbin-plus-nodeBB federation, since it’s all ActivityPub data flowing through the pipes, modulo a few use-case-specific extensions. In this sense, while very few news outlets will be standing up their own Lemmy instance any time soon, there is a very real sense in which case the health of Lemmy and kbin’s community contributes to the health of Mastodon’s and Threads’ network effects as a kind of indirect ally and user-sharing agreement. More concretely, improved interoperability in the coming months and years could mean more user-sharing and cross-platform “comments sections,” for better or for worse (and probably highly customizable regardless).

Oops I Have Even More Fediverses To Explain

Even further afield, and probably much more relevant to news outlets thinking on a longer timeline, are three platform giants incrementally lumbering towards Fediverse interoperability as fast as you can expect three extremely complex codebases to move without breaking anything vital for their millions of users. These are:

Discourse, the open-source, self-hostable, and enterprise-hostable forum software that powers a huge chunk of the subject-matter-specific Knowledge Bases of the world, is quite far along in rolling out ActivityPub support for any Discourse user that wants it. (Fun fact, ActivityPub developers use an ActivityPub-enabled Discourse server every day to debate protocol improvements and UX ideas, and some day Ethereum developers might as well.)
Wordpress, the most widely used and load-bearing Content Management System in history, is rolling out support to make any WordPress “followable” from any ActivityPub client, and working out ways to even make ActivityPub-powered federated comments sections available to blog-style sites, one of the original goals of the standard dating back to 2010.
Similarly open-source (or at least, Community-Licensed, for the nitpickers) and self-hostable Gitlab, which has long beaten Microsoft-owned GitHub in just about every category except visibility, social-media-like network effects, and free-tier subsidies, might finally see some progress on at least one of those categories by enabling ActivityPub, making repositories or packages followable actors for people that want to be notified about (and comment on) binary publication events or security disclosures. (Note that the even more indie option, Forgejo is also working on ActivityPub support, which of course opens up the possibility of a whole Gitiverse as well, or maybe Forgeiverse?)

An Actual Everything App, or even an Embarassment of Overlapping Everything Apps

One of the most esoteric aspects of ActivityPub as a data standard and minimalist messaging protocol that I have only hinted at until now is that ActivityPub isn’t, at its lowest level, anything like a Social Media application or even a general-purpose language for building diverse social applications. It’s actually something far more generic: a messaging protocol somewhere between e-mail and RSS (RDF Site Summary), which despite a somewhat fumbled standardization history is still very load-bearing after all these decades since it’s the basic mechanism by which podcasts publish new episodes and podcast software can interchangeably subscribe to them.

This means that while Twitter-clone Fediverse and Reddit-clone Fediverse and Git-collaboration Fediverse and Wordpress Blogiverse can all support tailored apps, it’s also technically imaginable that someone might just use a hypothetical “Threads suite” to access all of the above without leaving their Instagram/Facebook account. If that sounds too dystopian for you to stomach, think of how many email apps you can use for how many email accounts, and still have the basic concept of “emails” amassed in interoperable archive formats, but for… every other kind of social interaction and message and subscription and the like.

This radical commonality of underlying formats means that not just one Everything App but many Almost Everything apps will probably coexist for the foreseeable future. The open-data substrate of ActivityPub data (or, for that matter, Lexicon data, the BlueSky equivalent) can be thought of as an Everything-layer for publication: instead of a zero-sum game to control and monetize that shared data, applications will have to compete on comparative utility to take any subset of that everything and any subset of the shared userbase. Business will find a way, and realistically so will advertising, but at least the data and the attention-graph of each account (if not the social graph properly speaking) can stay locked open and accessible by multiple authorized clients and widgets and services. Or, hippie dream to end all hippie dreams, the pendulum could swing even a little bit in the other direction, and the web as we know it might just get a bit less consolidated, less corporate, and less advertising-driven.

This shared-data substrate might be too big for any one Everything App, too big for even a hundred Major Global Websites and App-store monopolies to extract even half the value it generates. The arc of economic history might bend meaningfully towards decentralization, of the kind that actually benefits the information businesses and independent actors in the system. Who knows, maybe the entire web could support multiple news-wire-like channels that share infrastructure and cooperate, instead of a cut-throat competition between attention-captivity ponzis.

In such an economy, information businesses could stick to their core competencies and worry less about this or that platform’s algorithms and this or that platform’s reach-buying economics– platforms might actually cede some centrality to protocols here, placing a ceiling on vertical consolidation. Information businesses could just communicate more directly with their readers (in a more advanced version of the substack/ghost model), or more precisely, outlets and reporters could just deliver straight to each reader’s post-email, powerful ActivityPub inbox, accessed by robust combinations of clients and servers that sort, triage, and manage it. (In an alternate, better universe, our email inboxes might have been just as co-managed, instead of consolidating to a world with barely any competition for gmail and company-emails, but that’s a subject for another, even longer essay perhaps.)

The Beach Beneath the Pavement

In a way, the basic premise of Mastodon and Lemmy and other self-organizing, bottoms-up configurations of autonomous and omnipotent “servers” arrived almost 10 years too soon to be viable economically, and got mistaken for dead-ends due to just how inclement that climate was in those years. It sounded like a 1968 hippie fever-dream, in the cynical 2010s, to standardize in advance a data language that would be the bedrock information-plumbing layer of multiple open platforms on which multiple future products would have to find their way to market, committed in advance to an open graph of public data in common. In the golden age of platforms that were also websites, that were also data languages, that were also horders of closed social graphs, locking open and sharing even one of those four layers was a hard sell and an uphill slog! Decoupling all four to be independent layers of a stack that had to somehow cooperate to compete with the hyper-monetized Goliaths of the day was, even more than publicly sharing the “crude oil” of social data was unthinkable back then.

But now, as giant websites are incurring giant liabilities and disappointing their once delighted end-users and their general counsels alike, it’s not looking so impossible to federate and share utility costs. Calls to commoditize user-management and share infrastructure with grassroots, self-funded community infrastructure is coming from the most surprising corners of Silicon Valley these days, for the strangest of reasons (sadly none of which have to do with appeasing the FCC). Even if advertising is still as oligopolous and mediated as ever, at least their engagement operations are loosening their grip on the readers of media and gradually disintermediating themselves from the telemetry-obsessed, silo-filling marching orders that made them so unpopular with regulators for decades. The Goliaths of yesteryear are suddenly all too eager to decenter themselves, if not completely disband the layers of their monopolies into distinct legal entities that orchestrate totally different platform economics where once they operated in lockstop. The primitive accumulation and vertical monopoly phase is over, and this is great news for journalism and information businesses generally regardless of how uncertain the next ten years will be.

It’s terrifying, at least tactically, to face a power vacuum, but it’s also great news that a generational hegemony is just running out of steam and giving up. A new age is dawning in which it will be increasingly less profitable (and bankable) to fence and horde network effects and information flows. Say what you will about ActivityPub as a global data language, and I am the first to admit it is a wonky Frankenstein of a standard today, it is at least a workable Esperanto, and a lot easier to hold my nose and work with than a global [data] empire of California accents on which the sun never sets. The W3C has certainly shipped worse.

I don’t even worry about whether BlueSky will be more succesful sooner by federating those 4 layers in a different order, or by inventing its own streamlined data language, because either way the end result is immensely better than status quo, in that whichever federation protocol wins, we win as end-users. Either way, the economics of information pipelines shift in a better direction, with more regulable and accountable utilities (“firehoses” and “clearinghouses” and “archival nodes”) replacing today’s vertically-consolidated Standard Oils of social data. Everyone can only stand to benefit from that: today’s vertical monopolies are holding our grandmothers’ inboxes, our local newspapers’ loyal readers, everyone student and voter’s primary and secondary sources, and our baby pictures hostage. However federation proceeds, it’s a kind of de-growth and antitrust jubilee, loosening the strangehold that hyper-consolidated advertising has had for decades up and down the value chain of information industries.

And I’m pumped, dear reader, even if I’m a little scared I won’t have a few American companies to blame for every outbreak of misinformation or every election obvious swung by some targeted filter-bubble ad-spend. It’s the only future in which I still want to have a computer or a smartphone, really.

Show me your papers, Mr. DAO

2024-07-17T00:00:00+00:00

Context

After building and widely deploying Decentralized Autonomous Organization (hereafter “DAO”) toolchain for record-keeping based on the Ethereum standard ERC-4824, DAOStar finds itself a little stumped on how to “v2” its contingent “identity system”.

After discussing the problem widely with stakeholders inside and outside of that community, Joshua Tan asked me to propose a technological approach to the para-legal identity of decentralized autonomous organizations, hereafter DAOs. I do not claim to be an expert in any of the domains touched by this problem set, but at least I’m comfortable proposing solutions to thorny interdisciplinary problems, which seems to be the prime qualification for this kind of work.

I’ll start with my understanding of the main problems that could be solved by a DAO-tailored identifier scheme (be it a DID method, a URI scheme specified precisely enough to be registered with IANA, or some unknown third thing), written in a playful provocative (maybe even bullying) tone. Then I’ll sketch out the shape of a solution and a few high-level approaches, and only then will I turn to a list of candidate solutions, to be validated and elaborated later (perhaps even prototyped in miniature?).

Doing any more single-handedly wouldn’t be appropriate, so I’ll throw this bucket of chum into the deliberative waters and see what questions and next steps the group chats come up with.

Problems

Provocation: Opt-in Civilization

DAOs are, like blockchains themselves, willful things, contingent, emergent things, rails in front of a moving train. The problem of identifying a DAO is the first step towards defining and stabilizing a subset of all possible DAOs and calling those the “more-recognizable mainstream” of DAOs, whose records stand a reasonable chance of passing muster in most of the legal recognition pathways discovered and expanded in the coming years. Which is to say, a more robust system of verifiable and/or non-repudiable identity statements is a necessary, insufficent, and highly pragmatic step towards addressing DAO’s legal trilemma.

Opting into an identification scheme, in the best of cases, would reach a kind of Pareto distribution, allowing 80% of DAOs to sacrifice a little of their freedom and agility to scale up organization, record-keeping, and (if I may be so bold) even legal infrastructure appropriate to a novel form of property-first legal person. To optimize for legal recognizability in the design of a Dao Identifier system (hereafter “daoID”), I’d like to steelman the most skeptical, anti-DAO critique as the starting point for a design process for the papers that DAOs might hand over to their critics. Trust me, this is the fun part!

Upon what radically and unaccountably opt-in basis does the DAO even exist to be the referent of an identification?

Many people like to start histories of decentralization at the Bitcoin whitepaper, but the savvy critic dodges this framing by pointing out that cryptographic actor models were just a tooling improvement on a much older category of legal fictions, namely, those of “private money”. By definition, private money wills itself into being, and self-assigns a degree of rights-granting power normally reserved for sovereigns. What matters for our purposes here is that private money goes at least as far back as state-backed money, and is defined in opposition to it, deliberately occupying a gray zone of contract outside of courts, like a casino on a boat operating under maritime law, which can be thought of as a kind of minimalist extrastatal legal framework.

As the actors granted these non-state-enforced property rights by that self-sovereign money system pile in and start interacting with on another in novel, even experimental ways, the complexity of that system grows and bumps up against its insularity, particularly as dollar amounts snowball and conflicts snowball as well.

But how and where to sue your rugpuller, after a private-money deal goes south? How to borrow against an onchain fortune in the offchain world, or vice versa? How to transact onchain about any but the most loosely-regulated of services and intangibles?

The pioneers of this new world find themselves sorely lacking in legal groundtruths. The DAO is a legal person without a court, a stateless corporation, a legal fiction sprung fully formed, Athena-like, from the head of a monetary one. What will be its by-laws, and how can anyone else be expected to enter into contracts or relationships with this mysterious legal Odradek, the legal person composed entirely of unnamed foreigners, which does business under no nation’s contract law?

Stated as a design goal, we could say a valuable daoID system is sufficiently decoupled from any blockchain as to have all its records “exported” or “translated” to be legally intelligible even in countries where blockchains have no status, legally and/or technologically.

Provocation: Opt-in Globalization

If capital is coded in law, how can global capitalism exist in the absence of a global state and a global legal system? / The solution to this puzzle is surprisingly simple: global capitalism can be sustained, at least in theory, by a single domestic legal system, provided that other states recognize and enforce its legal code. Global capitalism as we know it comes remarkably close to this theoretical possibility: it is built around two domestic legal systems, the laws of England and those of New York State, complemented by a few international treaties, and an extensive network of bilateral trade and investment regimes, which themselves are centered around a handful of advanced economies.

(Emphasis added. Katherine Pistor, The Code of Capital, A Code for the Globe, Princeton University Press, 2019)

Fundamentally, a DAO is legally an organizational form in much the same way that a cryptocurrency is legally money (and/or commodity and/or security and/or cash-valued coupon): it presents novel forms of documentation, registration, and transparency which are internally cohesive and which is clearly transacting realworld value and negotiates for some recognition in exchange for some taxation, regulation, control, harm-reduction, etc. This “petition” can hopefully scale up over time as more DAOs re-use common mechanisms, data models, and organizational forms; that is, of course, the work of DAOStar and Metagov more generally. But a key assumption in what follows is that these petitions cannot double down on a technosolutionist appeal to data transparency, particularly if it comes coupled tightly to one blockchain (or multiple blockchains, or multiple kinds of blockchains), as both record-keeping system and source of truth, because this requires the targeted state to recognize that blockchain legally before recognizing its records. A DAO has opted in to both the blockchain in general, and a blockchain in particular, all but requiring a government to follow it into both acts of faith if the evidence it brings to be taken at face value. This kind of “maximalism” is unrealistic and amounts to infrastructural lock-in that any state would rightly be suspicious of, even if it didn’t also risk displacing the current Pax Americana with a Pax Onchainica.

DAO’s current records should not have to wait until after all the monetary regulatory issues are sorted internationally, and even if they did this would probably not protect the more emergent, “organizing” as opposed to “organized” side of the DAO culture that is of so much value to those trying to resist the total financialization and privatization of all social forms. Instead, what today’s flexible on-chain mechanisms for governing capital and data need is flexible and robust ways to “link out to” and produce offchain records that are meaningful and trustworthy according to specific legal targets, such as the three interlocking goals of the trilemma. This means tooling and overhead, of course, but it also allows DAOs to earn recognition faster and in parallel to the recognition of blockchains as capital pathways, rather than sitting downstream and waiting for the mountains to move.

On both sides of the national/extra-national divide, much faith (or at least a lot of translation and corroboration) is needed here before such petitions can land, regardless of how legible the outputs of the proposed record-translation is. Not all DAOs will want to be identified in a legally-recognizable (and therefore liability-multiplying) way, just as many jurisdictions will not recognize stateless organizations in any form no matter how convincing the analogies. To come back to Pistor’s take above on the nature of globalization, most legal systems only recognize the overlap in a Venn diagram of their own legal systems and what they’ve signed treaties committing them to recognize in the legal systems of friendly nations, and this is often little more than the basic precepts of New York State and London tort and contract law, with a complicated switchboard for negotiating which “local” laws apply in cases of disputes across borders.

A DAO’s recognizability, against this geopolitical backdrop, is one of negotiating with states already engaged deeply in regulatory arbitrage, and already competing on open-mindedness and accepting risk in exchange for investment from mobile capital. These same risk-embracing jurisdictions are also acutely aware of, or even banking on, the limits of that recognition largely ending at their borders, in that only foreign counterparties willing to accept their court as venue for conflict resolution will be doing business with the DAOs they recognize. This means DAO-curious jurisdictions are volunteering to act as a kind of legal “entry point” where a stateless organization can get some tentative or partial recognition qua legal form, which by a lossy transitive property applies to the more conservative jurisdictions from which the globalized economy hands by the patchwork of international mutual recognition treaties.

In summary, there is not and will never be (I hope, at least) be a single global standard of “externally legible” targets for translating a DAO’s records to, even if New York State and London property and securities law come worryingly close to being that in the present global order. Instead, the translation of a DAO’s records, membership, holdings, governance, etc., will always be a dynamic and open-ended patchwork. An “identity” made up of non-repudiable, public claims (and probably even some provable negative claims, i.e. commitments that exclude private counterevidence!) is, indisputably, necessary but insufficient ground upon which to make sure claims to existence and exertions of rights fit through the moving goalposts of a global competition for regulatory arbitrage we could call the international “fine print” Olympics.

Provocation: Identity Within and Beyond a Blockchain

Using a blockchain to self-manage identities is not enough to make an identity fully “self-sovereign”

(Src: Me, 2018)

There are many ways in which a blockchain is a less unitary and definitive ledger of social consensus than people like to admit: each layer on which a blockchain identity is contingent adds to the cross-chain and off-chain contingencies piled up behind the word “identity”. I’ll walk through these quickly as a kind of to-do list, which any reasonable blockchain-based organization would have to address when making itself externally legible to a blockchain-critical world.

The most obvious of these are so-called hard forks, whereby the community of actors and participants in a cryptographic consensus system splits by a kind of cellular mitosis and produces two branches of one linkedlist of shared state. The early years of blockchains as economic systems saw them compete in relatively primitive and direct ways, interoperating as little as possible and spawning forks off of “mainline” Bitcoin one after the other every time a minority reached critical mass to survive a succession. An identity rooted in a blockchain can only be as stable or unambiguous as the identity of the blockchain itself: furthermore, there needs to be a strict 1:1 mapping of blockchain identifier to the ledger or cryptographic substrate vouchsafing it, at least for any bounded period of time whose records will later be credible.

The social consensus to intervene directly in chain-state and reverse the damage of the famous “DAO Hack” caused a particularly contentious and ideological hard fork in early Ethereum’s fledgling community. If Bitcoin’s many forks and carbon copies showed the fragility of userbases and social contracts, the genesis of Ethereum Classic shows that even within the opt-in construct of a blockchain, coalitions of the powerful can move mountains and reverse commits. What the DAO Hack remediation achieved by extra-technical means, a garden-variety 51% attack can also achieve within code: the data is only as good as the consensus around its production.

There is an obvious, commonsensical appeal to registering an organization on the global state machines powering 24/7, always-on capital markets. But some of those global all-powerful state machines (cough, cough SOLANA) just go down sometimes, for a day at a time. Others have their node-to-node networking intercepted by Great Firewalls or cyberwarfare brown-outs, and the oceans haven’t even really started boiling yet. How does an on-chain entity square its internal records with what happens off-chain during, say, a daylong blackout, or a continent-wide brownout?

In the traditional world of finance, locking up assets as collateral to borrow something else, then locking that up as collateral, or speculating with cash borrowed against unpaid but legally-binding invoices, or other forms of risk-multiplying leverage stacking, is called hypothecation and it is tightly regulated and monitored to contain its worst case scenarios from creating domino effects in the capital markets. On blockchains it’s just called “DeFi” and glorified as a value unto itself. Those domino effects can cause prices to swing so much that even banal things like “gas” (per-transaction usage costs) and transaction speeds are affected for unrelated co-users of the same system (like DAOs or DIDs).

In this sense, what would be legally useful to get asset-handling DAOs recognized legally is a kind of cross-chain identity that effectively buttresses realistic treasury/asset-checks and solvency audits. Making such checks aware enough of lock-ups, staking, bridge-locks, time-locks, etc etc is a tall order, and hopefully orthogonal here, but I mention cross-chain assets because a necessary (and insufficient) condition for such checks is an identity system that declares (hopefully verifiably and non-repudiably) assets controlled on multiple chains and in multiple kinds of locking contracts such that any legal situation requiring entities to prove their solvency or expose assets for audit can check such assets against public records rather than just-in-time disclosures.

Maybe there is no other way to incorporate all of this generically enough in an identity system than to add one more optional array of URIs, but it’s worth mentioning if we’re listing MUSTs and SHOULDs. Maybe many of these failure-modes are already accounted for in the legal systems being addressed, but keeping them in mind when designing mitigations and translations ensures there is some foothold for triaging this issues as piecemeal recognition proceeds and travels to complex global corner-cases.

Provocation: Ships of Theseus All the way Down

Even vanilla legal entities are leaky abstractions over leaky ships of Theseus.

Delegation (i.e. who can actually represent a collective at any given point in time) is as tricky a legal technology as it is in data management systems. In fact, the latter inherits most of its mental models and terminology from the former, as anyone deep in the literature on authorization can tell you; if the latter seems rigidly and needlessly heirarchical, it’s largely due to mental models inherited from the former. When parties start disagreeing on who could represent what collective when, the fallback and groundtruth is to look up public documents and (recognized, verifiable) public claims to trace who has the most leg to stand on in the dispute; crucially, this includes named officers of incorporated entities and board members of public entities. In incorporated entities, this is easy enough– named officers and board memberships are pretty clearly public and available information, and the burden of publication is on the currently-public members any time membership changes need to be published.

This may also be a nice-to-have rather than a MUST, but if we are using dedicated bags of member-URIs to express membership, then it is important for these bags of URIs to be checked not just now but historically against public chain-state. This is simple enough to do when all canonical records live on the same [stable, never-down, long-lived, fully-public] blockchain, but any multichain or off-chain mechanism would also need to be just as historically verifiable, at least to the level of granularity of most-recent block at any given timestamp.

Furthermore, even setting aside multi-chain gaps in logic, delegation systems too tightly coupled to any on-chain logic (native or custom) tends to be less legally intelligible because it requires expertise to translate to legal fact, based on what exact version of client software was running network-wide at the time of that block, etc etc. Auditable, standards-based delegation languages produce more unambiguous audit logs, which are cheaper and easier to translate into evidence; the next best thing is chain-aware groupware that translates decisions and delegations to a neutral language at the time, in case of disputes years later when chain-specific logic becomes an expensive historical matter to resolve.

Ironically, delegation has long been the biggest design challenge in finding the translatability and equivalence between DID methods. It has also been a kind of overwhelming quagmire in the design of some DID methods not locked to a single blockchain, where the field is wide open and more choices can be something of a curse to designers and communities, choosing between how to accomodate multiple of the world’s online and offline delegation languages and authorization event-logging systems. Which brings us to the most vexing provocation of all:

Provocation: What even does anyone mean when they say “DID”?

Depending on whom you ask, a DID method can be:

a URI scheme with a resolution mechanism stapled to it
an export format for identity documents allowing actors outside that that system to deterministically and transparently fetch the current document for a given URI
a bag of keys and, optionally, endpoints (both serialized as URIs) that can be fetched for any specimen of a given type of actor in a DPKI (decentralized public key infrastructure) system
a specification formalizing how to dereference a URI to… a document containing more URIs (i.e. a slightly more kitchen-sink URI scheme)
an abstraction allowing a cryptographic identity to mutate (i.e. rotate, grow, and shirnk) verifiably over time
a user’s manual for a censorship-proof and resilient identity-document substrate
the constitution for a service that dereferences URIs to certified documents, whether by delivering self-certifying documents or by certifying the documents it delivers

While this might sound like hyperbole, I have attended in-person meetings of the DID Working Group over the years, and each of these is actually a flippant parody of the working definition various key members of that working group hold and act on (and, debatably, pull the rest of the working group towards over time). There has been some progress over time to tighten this loose bag of purposes, such as my old boss Wayne Chang’s ideological intervention arguing for a feature-matrix approach to DID interoperability and migration thinking.

But fundamentally, we’re still here, trying to figure out what generalization is worth making about DIDs, when there is such diversity between their URI schemes and their resolution mechanisms, each grounded in a radically different kind of ground-truth-producing system. They’re not much more generalizable or schematized than DAOs– if anything, even more emergent on both the technological and social/business layers. There is no general sense in which “DIDs”, in general, provide a path to meeting all the requirements above, nor are DIDs easily classifiable into a few categories, one of which would be. That said, the “trait” methodology mentioned above that some are using to differentiate DIDs by featuresets might still be useful in finding 1 or more DID methods that are useful enough to be better than status quo, and for writing a specification that names everything else a better DID method would have. That’s what I have done, at a high-level, below, and could do more systematically once requirements have been clarified and validated if it makes sense to hone in on the applicability of existing DID methods versus a novel one.

Historical Verifiability of Mutable [DID] Documents

Historical resolution, long a contentious and optional feature of DID methods, is actually only supported by a few of them. Most DID method designers punt this extremely difficult set of cornercases to distinct “internal” or orthogonal mechanisms (i.e., independent of the DID document’s resolution), rather than make available to any consumer of the DID document detailed information about what a given DID document was at a specific point in time via the option versionTime= query parameter of a DID URL. Indeed, even query parameters themselves are kind of rare in DID-based systems, so limiting ourselves to DID methods that handle historical verifiability automatically is a fairly constraining decision without many options, and should be compared (apples-to-apples) with solutions where such verifiability is guaranteed on another layer.

The Shape of a Solution

To decouple a bit the question of whether or not the problem space is best addressed by one or more current or future DID methods from a higher-level analysis of the space itself and the requirements, I have elaborated some families of options first. In so doing, I made a solid assumption that a daoID defined a DAO over time without taking authority or priority over the daoURI valid at any point. Boiling down the list of functional requirements mentioned above to “traits” needed from a daoID system, we could enumerate them approximately as:

Summary of Requirements above

Opt-in upgrade: no current ERC-4824 DAO should need to implement or change anything if the kinds of legal recognition or non-repudiable record-keeping enabled by this proposal are not urgent for them.
Some kind of historical verifiability of previous versions of the daoURI document.
Either a long-term identifier per DAO OR a way to derive all previously-authoritate daoURIs over time from the current one (for those opting into such long-term identity, as per req. #1).
A chain-agnostic/off-chain daoID Document, if established, needs to be verifiably and bidirectioned linked to the on-chain daoURI Document,
Extensible enough to verifiably link to additional novel Document types, so as to accomodate additional reporting, auditing, authentication, deanonymizing, and other mechanisms as needed for legal recognition (particularly ones where underlying blockchain records and identities are categorically inadequate and external notarization or machine-readable equivalence statements are needed)
Unambiguously resolvable in case of blockchain forks or outages or other availability issues
- (May require explicit disambiguation logic around forks as force majeure)
Clear fallback for DAO-Hack style intervention in chain-state
- (May require explicit deferral to or from altered or rolled-back chain-state)
Multichain and Multi-VM dereferencing: child-DAOs, members, and controlled accounts might all live on other chains than the daoURI, and these must all be expressable at least in the form of a daoID, if not in the legacy daoURI itself

Relationship to the daoURI() interface

Realistically, a daoID is only worth offering as an extension of ERC-4828 and daoURIs as they exist today. Stated technically, we could say that a worthwhile (and realistic to deploy) daoID would need to be a strictly backwards-compatible superset of daoURI, i.e., every daoURI returned by the daoURI() on-chain calls possible today MUST be a valid daoID value. Practically, this means that a daoID is just a BIGGER bag of URIs, which might add some optional properties if there is value in adding them, resolved in a different way than just querying the same chain with a different call. That passive-voice “resolved” does a lot of work, though! DID methods, in the best of cases, abstract over [pre-existing] resolution methods to bridge consumers and producers inside and outside of them, whereas here any resolution method chosen is a novel infrastructural commitment demanded of every DAO self-publishing these records in addition to their authoritative daoURI records on their [primary, current] chain.

Whether the solution involves DIDs or not, the publication and resolution mechanisms for these chain-agnostic documents make or break the feasibility of any such proposal.

daoID Document as Microledger of daoURI Documents

Working backwards from failure modes, one backstop that strikes me as a hard requirement to commit to upfront is for a given daoID to be considered valid IF AND ONLY IF it also is or contains the current daoURI document. Extending the daoURI document system is already a huge ask of daoURI users, without also creating a risk center and attack vector in case the extension gets out of sync somehow with the simpler, easier-to-maintain legacy form.

This property could be achieved two very different ways: making a backwards-compatible version of the daoURI method that returns additional properties and all the current ones, OR making a daoID that lives somewhere else, includes a link to the daoURI, and is not considered complete until BOTH documents have been dereferenced (and perhaps merged). I refer to these two variants as “second-document” approaches, above and below.

From simplest to most complex resolution methods, I can think of three families of solutions here:

Solution-Shape A: Back-Linking daoURIs

A single property could be added to each daoURI document called, say, previousVersion which pointed to the most recent previous state that the daoURI resolved to (if addressable as a standard URI, or at least to the transaction that updated it, if this can deterministically produce the previous version). This would mean that from today’s daoURI, one could walk back across all updates to inception and have the history of each Dao as a “microledger” of daoURIs. Since each finite update to daoURI doc and/or memberships or other important properties occured in a timestamped (or at least block-numbered) transaction, fetching historical state for a given daoURI at a point in time would be deterministic, if slightly networking/compute intensive without a comprehensive indexer, historical node, or at least a pre-loaded cache, of entire histories of each daoURI.

Note: this would only really be feature-complete for net-new DAOs, since the microledger would end abruptly at the first post-implementation update for any DAO whose history stretches before the implementation.

Unlike the two alternatives below, where the legacy document and resolution is extended by a second document type and resolution mechanisms, requirement 4 is moot here because no second Document-type is added. By iterating the core daoURI mechanism, though, all changes to support the other requirements would have to be made together, along with the backlinking change, in one atomic upgrade. For requirement 5, this would simply consist of additional properties, but solving for 6, 7, and potentially even 8 might not be possible in this solution-shape, or at least require validation research in a different direction.

Solution-Shape B: Off-Linking from daoURIs Documents to daoID Documents

Similarly, instead of previousVersion, a property like alternateDocument could contain 1 or more URLs pointing to an “extended” version of the same document, which also includes metadata about update history, links to previous or individual versions, etc. This could be content-identified, i.e. https://{ipfs-cid}.subdomain-gateway.ipfs.io or ipfs://{ipfs-cid}, or some other content-address, data-availability URL, in addition to or instead of conventional HTTPS urls.

Since this off-chain copy of the same document would probably be less constrained for space than something on-chain, it could conceivably take a different (i.e. more complex, even layered) structure than merely adding a single backlink. For example, the daoID could refer to an intermediate document containing links to every version of the file sorted in reverse chronological order, tuples of links and metadata, triples of HTTPS link, alternate link, and metadata, etc etc. Whether or not this added complexity here makes sense depends on the trust model and on the degree to which it solves the legacy problem of daoURI document historical pre-upgrade.

Mitigations for requirements 6 and 7 can be encoded quite simply in parsing rules for off-chain documents as a “state of exception” to requirements 1 and 4, i.e., “until outage/fork/consensus issues/availability issues are resolved to X standard, most recent on-chain daoURI Document remains valid and off-chain extension document can be acted on.”

Solution-Shape C: Completely Separate Resolution Mechanism

If, instead of changing anything about the daoURI Documents, a second step were introduced after resolving the daoURI, no change would need to be made to how daoURIs are published and resolved. For example, with the current daoURI document in hand, one could take the daoURI (or a hash of the current daoURI document, or any other deterministic transformation of it into a URN) and resolved that in some alternate registry (onchain or off-), to get back the “extended version” of the Document, or all other documents pertinent to, linked from, or linking to that Document.

“Deterministic” is doing a lot of work in the sentence above, because the universal determinism of [canonicalized] daoURI documents to URNs is required throughout the system for them to be reliably used for discovery, deduplication, and subsequent authentication (i.e. to function as a checksum). Note that an IPFS CID here is only as deterministic as the degree to which its generation is constrained to a very opinionated profile: single format, single hash, etc. For example, hashing all records in a today’s default kubo configuration, or LUCID configuration, or SHA-256 after JCS canonicalization, or Blake-2B after RDFC canonicalization (since they are all JSON-LD records after all), is all but locked in for the life of the system and runnable by all actors and form-factors.

Mitigations for requirements 6 and 7 can be encoded just as simply as above in the parsing rules for off-chain documents as a “state of exception” to requirements 1 and 4.

Even if append-only, even if content-addressed, this approach also brings up the usual governance questions about how a common corpus or registry could be governed in common, since it would be a public good and not free to maintain.

Proposals

I will only mention these at a very high-level for now so as not to invest time elaborating on any of the following which might be eliminated by refinement of requirements and priorities. Ideally, this list could grow or shrink considerably before validating and elaborating any of them.

Candidate Solution: Minimalist Upgrade to daoURI (only add backlinks and minimal/open-ended extensibility)
Candidate Solution: ENS-based second-document
Candidate Solution: A Ceramic-based broadcast/self-publishing network
Candidate Solution: A did:dht broadcast/self-publishing network
Candidate Solution: A did:tdw (i.e., HTTPS-based) microserver per DAO

Acknowledgements

This research was supported by a fellowship grant from DAOstar and Metagov.

References

Copyright

Of Grammatology: IPFS as Language Family

2024-07-04T00:00:00+00:00

People often ask me what the future holds for IPFS, or how much community and/or market demand there is for further development and hardening of IPFS. To which I always ask: which IPFS are you referring to?

IPFS for me is not a network, or a toolchain, or even a set of technologies: it’s a family tree of ambitious approaches to data. An objective (and interoperability-oriented) map of the territory would probably look like a sprawling Venn diagram of toolchains and technologies, each of which plays nicely with its neighbors but laboriously with its cousins. As with any family tree (of language, of humans, or of technologies) there has been some serious drift over the years, and specialization, and adaption to different environments. And like languages, technologies can also drift into mutual incomprehensibility after enough drift and specialization: dialects becoming languages, translators become necessary, and thus dictionaries and reference grammars becoming necessary for scaling them up.

I’d like to sketch out one possible map of the territory, for organizing backlogs and futures and community governances and whatnot.

I’ll do my best to keep it funny and goofy, but this is basically a longform report-out of months of research and conversations and dredging github threads to tease out the shape of interop pasts and futures, so forgive the passionate verbosity.

From Principles to Systems Made Up of Components

The broadest and currently-canonical definition of “what IPFS is” can be found in Robin Berjon’s “IPFS Principles” document, which defines the particular IPFS flavor of Postel’s Law by which arbitrary data is broken up into 1 or more content identifiers and passed around a system between storage providers and consumers. This first-principles approach emphasizes the ambient verifiability, and rightly so–it’s quite the game-changer! Berjon extracts from the atom of the Content Identifier (hereafter CID) a micro-specification for determining (in the style of an IETF RFC) what the minimal requirements are for an IPFS system.

By Berjon’s pragmatic definition, any two systems that can exchange CIDs and use them to exchange data have just federated their two systems into one meta-system, specifically, one interplanetary file system. In this definition, any piece of software that can translate local context and outside data into the lingua franca of CIDs is a piece of IPFS software, or more precisely, an implementation of one or more pieces of a IPFS system.

So far so good, but the slippery bit is going from discrete software components to the boundaries of entire systems, each of which definitively is or isn’t doing an IPFS. The verification afforded by using CIDs is strong enough to cross a trust boundary, sure, but that just decouples the parts of the system and makes it harder to draw the boundary around the whole thing. For example, Berjon’s definitional excursus on the theoretical CID-verifying JS code snippet is telling, because nothing says “trust boundary” like “arbitrary JavaScript running in the end-user’s browser”. Implementing IPFS over the hostile terrain of the world-wide web requires a lot of components to work together to make CIDs a narrow waist of trust; but are they really working together in one system anymore?

I would argue that Berjon’s definition of a CID-based imperium defines the language family of IPFS (let’s call it Latin), but it won’t settle territorial disputes about where to build walls and where to build bridges (much less those about who pays for them). With technologies as with languages, these disputes about where the outer and internal boundaries of IPFS lie offer lots of insight into the complementary but in some ways competing systems and optimizations within it. For this reason, I will intersperse my meandering argument with little explanatory excurses about conceptual edge cases, places where the common language broke down and unmet needs for translation sit uncomfortably in the common backlog. Each of these contribute to the broader argument about federation and decentralized progress, but in their own right, each excursis is a profile of a distinct language that needs just a bit of boundary-setting and specification to mature, to stabilize, to grow… and maybe even to find a bit of Language-Market Fit!

Content-Addressable Systems for Graphing Data that May or May Not Be Interplanetary File Systems

If we zoom out a little bit from Berjon’s atoms and component to the level of entire data systems, the server kubo would be a great starting point to define the layers of a typical IFPS system. Implemented in Go and accessible in many serverized wrappers and interfaces, is usually held up as the “definition by example” of what IPFS is. It is nominally the “reference implementation” of the whole system, in that if you install kubo with default settings in two different places and tell them to sync to one another, you have created an IPFS subnet which automatically contributes any data you give, as well as its compute and storage, to the public DHT. Your two kubo servers are IPFSing, clearly.

To keep layering on more cheeky wrinkles and jowls on my extended metaphor, the global system of kubo-centric networking is the Holy Roman Empire, or at least the Catholic Church. kubo is great at working with (and translating from the dialects of) a wide array of components specialized to different conditions and making it all one gigantic global data substrate. It is the heart of every end-to-end system that can be called IPFS today.

But this is a very tautological way to define systems. I want to define other systems (and how they could or could not be IPFS) by starting at the other end of the spectrum, the least kubo-friendly, and work our way back.

Excursus: `iroh`

On the other end of the consensus spectrum is iroh, a commercial data-syncing platform and toolchain from n0 computer with no dependencies on IPFS toolchains. Here, whether or you consider iroh an end-to-end IPFS system depends largely on your interoperability requirements and your use case (in a word, your definition of IPFS!). By Berjon’s definition, it is unambiguously an IPFS “implementation”, a fully-fledged language in the family tree, although any number of technicalities could be used to declare it only conceptually akin, a free-port pidgin that picked up some words but not the deep structure of the parent-language. Factually speaking, it uses multiformats and familiar CIDs and meets all of the requirements in Berjon’s definition, without being effectively interoperable with kubo.

The list of differences from their documentation is telling, and a useful place to start defining the “stack” of a CID-based data system, and its translatable (or not) interfaces. I’ll revisit this chart at the end of the article in expanded form.

Concept	Iroh	Kubo
CID Usage	Used for root hashes on the Blob layer	Used for all blocks
Hash Function	BLAKE3	Various, SHA2 by default
Maximum Block Size	none	1MiB
Data Layout	Key-Value	Directed Acyclic Graphs (DAGs)
Data Model	“Bring your own”	IPLD or “Bring your own”
Syncing	Document Layer	none
Networking Stack	Connection Layer	libp2p
Public Key Cryptography	ED25519 Keys	Various, ED25519 by default
Naming system	none	IPNS, DNSLink
Content Storage	User Files + Cache	Internal Repository
Verification Checkpoints	send and receive	receive

(Src ; permalink )

Let’s try reorganizing this list a bit, trying to think less in terms of all the many incompatibilities and more in terms of subsystems and dependencies.

“Maximum block size” is a point of comparison lost on those lucky souls who have never designed a file system or database, or who don’t stay awake at night wondering whether iroh is an IPFS implementation. It refers to the maximum size that a CIDs referent can reach before needing to be “wrapped in a file”, i.e., a map of multiple CIDs to multiple blocks of data.
- In a sense, this ceiling beyond which inputs must be chunked created the “file system” that the FS in IPFS stands for. It also kept CID resolution simple (if necessarily recursive) when applied to all the world’s data while preserving its “logical units” loyally.
- Data Layout is an even wonkier point of comparison, which basically refers to that recursiveness created by the file system by which resolving a CID to a map of blocks requires navigating a DAG rather than a one-step CID–>reference resolution. It’s not so much that kubo doesn’t organize its data in a key/value store… it’s just that it ALSO layers on a DAG since those values can contain further keys.
- IPLD is a low-level data model that, applied universally and interwoven with the above assumptions, creates many efficiencies and shortcuts. You could think of this as a minimal transformation of inputs that reduces overhead efficiently at the expense of needing consumers and distribution to share the same assumptions (and dependencies). “BYO data model,” by contrast, requires your consumers and distribution network to share that set of assumptions and dependencies instead!
- IPNS is a handy add-on component that adds a mutable pointer system to the DAG… but has a strict dependency on the DAG, which in turn was necessitated by the file system and its block size ceiling.

Hashing only with Blake3 versus the great many hash functions encodable as multihashes is a huge difference, one which troubles my extended metaphor considerably since multihash is, basically, the translation layer for CIDs. But I interrupted the re-ordering exercise to dwell on it because it is a massive difference, which relies on certain properties of Blake3 as a cryptographic function. Basically, Blake3 is a younger, not-yet-entirely-mainstream hash function optimized for large files and streams; it bucks the trend that justified the block-size ceiling in the first place, since it parallelizes compute on large inputs and allows verification-as-you-go for streams.

iroh choosing to rely exclusively on Blake3 as its only hash allows iroh to drop the block ceiling and the DAG and filesystem that follow from it, explaining almost all the downstream incompatibilities enumerated in the above chart. Dropping the DAG and filesystem drastically reshaped what the core library of iroh can do, and the makeup of the whole system around it, how it distributes compute between layers, and even its trust model.

The row Verification checkpoints makes this quite explicit: since each iroh instance handles flatter key/value data stores, which it partitions into the logical units of the Document Layer, the logistics of sharing and verifying are totally different– it can verify all data on its way out to have a flatter trust system, making for a totally different system architecture.

The only differences not downstream of limiting the Blob layer to blake3 are the narrower choice of keys used to identify actors in the syncing network and the analogous move to sidestep libp2p and use QUIC networking exclusively for the iroh network. (Like blake3, QUIC is younger than the block size ceiling and the libp2p networking stack built to help kubos find each other across all the NATs and networks of the internet.) Both of these choices greatly reduce the work required of the core library, by dropping much optionality (much of it related to legacy compatibility).

All of these choices are surmountable, but require data from one system to be packaged up into a discrete unit, translated, and “notarized” by a checkpoint between the two systems. Tellingly, this work is, years into the project, still TBD – who should do it? Who needs it? Who pays for it to be done?

This is actually the question begged by Berjon’s slightly circular definition of an IPFS system as one that can translate its CID-handled data in a way that will be parseable to… another IPFS system (emphasis mine):

[An IPFS implementation] must expose operations (eg. retrieval, provision, indexing) on resources using CIDs. The operations that an implementation may support is an open-ended set, but this requirement should cover any interaction which the implementation exposes to other IPFS implementations.

By this functional definition, iroh is definitively becomes an “IPFS system” when it gets a scaleable translation mechanism for firehosing data tokubo and helia instances at each “exposure” to them, which kicks the can down the road since on both sides of that exposure an interface needs to be implemented. And what possible exposures could there be? If logical units are being chunked and indexed and overlaid so differently, where could the two systems realistically compare notes or trade data? For this we’ll need to rephrase the feature comparison above to a comparison of architectures.

Mapping the Minimal End-to-End IPFS System

Here is a bog-simple architectural breakdown of an IPFS system, that we can use to schematize before applying to the morass:

Data ingress: data gets turned into CIDs.
- Optionally, the ingress system can introduce helpful abstractions like CAR files, data blocks, recursive DAGs, etc. Each of these may constrain (or be motivated by) a corresponding choice at one or more of the other three layers.
Sync: Data get synced and replicated and persisted by their CIDs.
- Networking can be optimized for global sync, e.g. via libp2p, and/or combined with allowlists and conventional authorization systems, depending on the mode and permissioning model chosen.
- In its simplest form, all data that comes in will be shared to peers; beyond rate-limiting, any other policy usually requires awareness of abstractions or introspection, thus coupling to choices at other layers.
Routing: The data and information about networking and routability get indexed and advertised to
- Advertising content to consumers, TTL, and other persistence consideration add more supporting abstractions (e.g. IPNI records, delegated routing, bitswap, etc.).
- Indexes and distribution networks can scale up to global scale, as on the public DHT or “IPFS mainnet”, or skew simple and static in bounded networks, such as on subnets, traditionally-enforced trust boundaries, by API keys, etc etc.
- Optimizing for global distribution favors global-scale “gateways” and a distinction between ingress/origination nodes and “persistence nodes” (F.K.A. “pinning services”).
Fetch: The last mile of dereferencing a CID from a potentially untrusted and often unfamiliar node.
- Depending on trust boundaries, fetch can incorporate a verification step to reduce the trust requirement on “gateways,” making “delivery” a networks - indeed, it probably always should

kubo can do all 4 of these functions, and in the early days of IPFS, various kubo instances configured differently actually did all 4 in every IPFS system. Client diversity has grown steadily over the years, as has specialization and helper-components like ipfs-cluster for load-balancing huge installations and elastic-IPFS for hyperscaling ingress. But the important part to note, as newer installations focus on drastic changes on one or more of these layers, is that they are not cleanly decoupled yet; choices at any layer of the four constrain and influence choices on the others, often introducing abstractions and complexity and overhead.

Finding a schematic, simple language for these optionalities and cross-dependencies doesn’t just get us a working definition of an IPFS system and an apples-to-apples rubric between them: it also clarifies boundaries and tradeoffs, letting distinct languages “compete” less and complement one another more.

Abstracting out public data networks from the Amino “Mainnet”

While a lot of design choices are downstream of the block size limit and the efficiencies of scale created to manage huge datasets of blocks below that size limit, just as many (maybe more) are downstream from the choice to support the public DHT, or as it is known in 2024, the “Amino network”.

“Private network IPFS” is often described in documentation and public-facing materials as a kind of useful side-effect of the IPFS toolchain originally designed to work as a global peer-to-peer network. kubo or helia can be easily configured to run well in a “private network mode”, but they still building internal indexes and broadcast-lists optimized for a global distribution network, which private-network data can be flung back into by an accident of configuration or an errant peer connection. The fission.codes blog post that first popularizing the nickname “Mainnet” for the “Amino” public DHT helpfully extended this blockchain ecosystem analogy by pointing to two “private networks”: source network and quiet.

I would argue they are still IPFS systems, by Berjon’s definitions and by mine above. What they are not is public-data systems that publish and persist all ingressed data on the Amino “Mainnet” or any other global public data layer. Other public-data networks could be supported by or even deeply integrated into an IPFS filesystem: one that, say, broadcasted CAR files with BitTorrent-style magnet:// links on the mainline DHT, or one that swapped out IPNS for pkarr and/or did:dht to advertise peerIds or other key-based identities. Or, even more abstractly, the public/private binary might be too simple for the boundaries of the data an IPFS governs: data could default to public by a kind of deadman’s switch, novel abstractions could create credible exits in either direction, etc etc.

Excursus: LUCID

While iroh shows a way forward for a branch of the family tree that traded blake3 for the Amino DHT, Lucid presents an even narrower cross-section of the CID-powered future, limiting itself not just to blake3 hashes, but to a narrow set of CID expressions generally. This creates some useful side-effects: deterministic CID for any given input, turning each CID into a checksum for verification anywhere blake3 can run and a deduplication mechanism; lexically-sortable CIDs, etc.

This also sidesteps the compatibility question, since kubo or helia or w3up can also produce these same CIDs if configured appropriately, and each could ingest, persist, broadcast and index them as normally (even if not in mutually-intelligible ways).

To put it another way, LUCID defines an opinionated ingress dialect that subsets that of the iroh ingress configuration options. It similarly leans blake3 and leans on incremental verification of large files to triage the architectural problems inherent to dereferencing a CID from a large file. Like iroh, inputs smaller than the Bitswap block size limit (which is effectively the Amino block size limit, for now) will generate CIDs that can be synced and indexed in Amino. Unlike iroh, however, no CAR-like collection is defined, prefering to maximize for CIDs as unique and opaque identifiers pointing to single resources without recursion or abstractions. This allows for URL-centric use-cases similar in shape to checksum-based prior art like Sub-Resource Integrity and Data URIs.

Abstracting out the IPFS from the Web as a URL-to-resource key/value store

While iroh was optimized for a distribution network architectured very differently than the Amino network, LUCID is optimized for web distribution, WASM, and other futures. LUCID strives to be the most lightweight path to a worldwide web that could be infested by the CID brain virus without having to take on any IPFS dependencies, routing schemes, protocol handlers, or anything else. Lucid establishes a path for a few hundred lines of javascript to secure ingress and fetch so that the entire (conventional, HTTP) web could be the distribution network for CIDs.

In this future, data can travel around between web servers and crawlers and backup services and security registries tamper-proofed by CIDs, but still live happily in today’s domain-locked browser security model and architecture. As distribution networks go, there are more immature ones!

Excursus: web3up, CAR Envelopes, and Content Credentialing

Web3.storage is a Protocol Labs-incubated startup that, like iroh, needed a different featureset than conventional IPFS to optimize for various aspects of their product offering: location-aware hosting and fetching, object-capability authorization woven into the IPLD graph alongside the data itself, heavy and novel usage of IPLD over discrete resources, etc etc.

The “CAR file” (Content-Addressable aRchive format), meant to rhyme with the Unix “tar file” (originally used for magnetic-Tape ARchives), predates Web3.Storage’s further elaboration of the format, but what’s relevant to this discussion is what they enable: translation between implementations aware and those unaware of Web3.Storage’s IPLD-native storage system.

For those unfamiliar, the illustration above sums up what a .car file is and why Web3.Storage made them a first-class citizen in all their tooling, to smooth over interop between their IPLD DAGs and the UnixFS-based prior art, for which they were simply vast collections of smaller-than-usual blocks. This one simple trick (ensuring that all “uploads” and “downloads” to the w3s network were formatted as a CAR file so that backing systems in the Amino Network could better parallize them and handle them as “raw blocks” and sidestep their usual UnixFS optimizations) made it possible to share infrastructure originally designed for and optimized for UnixFS data to smoothly take on new kinds of throughput. (Web3.Storage shouldn’t get all the credit, of course, as Filecoin also needed CAR files to smoothly interoperate with Amino while preserving the deterministic “snapshot” capability its usecases required).

What’s instructive here, to my mind, is that the .CAR file offers a kind of well-specified “outer envelope” that can be wrapped around CIDS that might otherwise confound interoperability and translation. “Checkpoints” between systems that only accept and produce CAR-wrapped data is certainly one way to aid not just interoperability between today’s systems but forward- and backwards-compatibility over time and for archiving data made by deprecated or rare tooling.

Abstracting Out Determinism from Optionality

Further down the road of a backwards-compatible but divergent subsystem within IPFS, the same team came across further challenges. As the specification for Content Claims formulates the problem,

File reads in this system requires that two entities, the client and the peer serving the data, both support an IPFS aware transport protocol.

Decoupling transport from IPFS “awareness” (and tooling, and abstracting extensions) was thus necessary to host data (however “dumbly” or unaware) on conventional infrastructure, and this required a kind of translation document or “bill of lading” at the borders of the IPFS-aware system. This meta-awareness document is called a “Content Claim” and it goes further than a CAR file in scaffolding interoperability across the 4 layers of the system.

Content claims contain the following class of statements about input data, the CIDs they generated, and what can be known in advance of resolving those CIDs:

location claims, including conventional/non-IPFS “locations” where the content of a given CID can be had (whether in addition to, or instead of, through Amino’s CID-advertising process)
inclusion claims, to frontload or guarantee the “recursion” inherent in UnixFS and IPLD alike (or to allow parallelization rather than recursion, e.g. for systems with different runtime memory/network-load tradeoffs)
equivalence claims between multiple CIDs derived from the same input data, use cases for which could include:
- the Blake3 hash of a large file generated by a LUCID implementation or iroh and the Blake3 hash of the same large file passed to a UnixFS implementation that broke it up into standard blocks first
- the native CID of a file long-ago encoded, and the same file’s equivalent in a narrow profile like LUCID, to allow for deduplication and “deterministic” CIDs

This kind of equivalence might seem a kind of hack or stop-gap in the current highly heterogenous and multi-use-case IPFS family, but I would argue they are a valuable primitive and the best thing we have in the medium-term future for bridging gaps between divergent optimizations and form-factors. Furthermore, each Content Claim answers, on behalf of a given set of data, what I have been arguing the implementations themselves need to answer at the component layer: what dialects they can produce, and what dialects understand.

Which way, modern file system?

I want to close the loop on a functional definition of what an IPFS system is, as opposed to an IPFS component or “implementation.” The excurses and 4-layer model have sketched out an architectural framework, but some synthesis is still needed– and some blood sacrifices.

In general, compatibility both backwards- and forward- tends to become a bigger and bigger burden as any system incorporates more use-cases and form factors; the question is not one of whether to deprecate, but on what timeline, and with whom punished for upgrading too fast or too slow.

My intuition as IPFSCamp 2024 approaches is that we are increasingly faced with questions not of “what is IPFS” but rather, “what is mandatory for a given IPFS among many.”

The key questions seem to be:

Is support for the global Amino network integral, or optional, for all IPFSs?
Even where it is supported, is “advertise all CIDs to Amino” a sensible default configuration going forward, or one that should be sandboxed to ensure safer segmentation and more diverse and integrated authorization schemes?
Are UnixFS and IPLD on equal footing? Is translation between the two and smooth coexistence a completed project? What are next steps for IPLD, and who needs them?
How to establish perimeters or subsystems within which specific narrow dialects (UnixFS-only, LUCID-only, libp2p-only, QUIC-only, etc) can be safely assumed and fine-tuned?
How to establish scalable checkpoints where data can flow out of those perimeters, while filtering or translating all data coming in?

Regime Change

Protocols and file systems and data models have fairly straightforward definitions, usually functional ones based on conformance regimes. For example, since HTTP is governed by a fairly hardened protocols, you only have to read a few RFCs closely to write a test that can separate the HTTP-practicing wheat from the troublesome chaff cluttering up the empire of HTTP. We use the slightly creepy metaphor of regimes for this kind of boundary-enforcement because testing makes boundaries as clear as a heart attack, or an army: they discipline networks, markets, and communities.

In IPFS, there are a lot of specs defining the major components and interfaces of IPFS specifications. Since so much weight rests on the “gateways” serving IPFS data over the web, and so many companies over the years have stood one up their own independent gateways, a full conformance test suite was established, and many specifications contain test vectors to help implementers know they have implemented the specs they are targeting flexibly enough to exchange data with others our decouple from today’s infrastructure and assumptions smoothly.

Truly going through with that decoupling, though, would require us to have detailed “profiles” for how to use each component per dialect, and conformance tests and test vectors at more places than just the gateway interface.

Above, I spitballed a four-layer simplification of the IPFS end-to-end system for the sake analyzing some illustrative example of intra-dialect translation. I’m sure someone will eventually convince me there are effective three, or six, or five layers worth segmenting out, but for the sake of some day finishing this overlong article, let’s call them four for now and decouple them at least at a high level:

CID generation –> Ingress dialect
Peer Networking –> Networking dialect
Indexing –> Data Circulation dialect
Hosting-Consumer Interface –> Fetch dialect

Ingress Dialects

Ingress profiles are pretty self-explanatory: the configuration of an ingress client often accomodates a lot of optionality, and one ingress client can often be configured to “mimic” another and produce the CID that this other client would have. Formalizing this with an IPFS spec might make interop targets easier to backlog, roadmap, and prioritize.

Formalizing this kind of CID parameterization would also allow for “CID translation” to be scaled up as a kind of public utility or shared codebase as well. Parameterization and corner-case trapping could make bidirectional translation of CIDs easier to engineer. Some CID parameters and multiformats codecs (including internal ones) have historically been underspecified, and one way of forcing the issue would be withholding or retroactively removing “final” status for any codec that is unsufficiently specified to be reproducible, lacking in test vectors or testing regimes, etc.

It also bears mention that CID translation is a kind of inefficient “last resort” for interop between IPFS systems: it decomposes a network to its smallest unit (the CID) and recreates it in a new language to be ingested in another network. In many cases, it would be preferable to instead make living systems interact in the present (and at system-scale) at one or more of the other three layers in a way that is aware of differences of CID generation. This could well require, at those layers, for “foreign” CIDs to be routed and handled differently, accomodating (if nothing else) the somewhat expensive work of translating CIDs for the benefit of the rest of the system.

Networking Dialects

Here there is much less diversity, partly because libp2p has been so useful for stitching servers almost everywhere and anywhere into the Amino DHT, so few other dialects have evolved. Historically, libp2p was the default option both for automagically forming synchronization networks on a peer-to-peer basis, AND for the formation of a broader and more heirarchical/scalable “content routing” (i.e. CID propagation) network for long-lived content. These two distinct kinds of networking operations used very similar mechanisms, and both required all participants to support abstractions like IPNS and the ingress profiles these depend on. Decoupling them, and allowing them to be configured separately, might be useful, and necessary for making the other 3 layers more independent and swappable.

For example, the kind of perimeters a system already needs might make libp2p and/or Amino optimizations (with all their dependencies and abstractions in tow) a bad fit for some use-cases. For example, systems that, by design, operate on a single network or cloud provider, or systems that already have authorization deeply woven into them, might not need either.

Little work has been done to specify this layer, partly because interoperability at this layer tends to happen as an ad hoc matter, if at all. It might be necessary if evolution continues on non-Amino systems that might develop a need to federate to one another. It could also be worth specifying sooner for projects like “supersetting” Amino and making it work more smoothly with other global-scale DHTs like the Mainline DHT that runs pkarr and BitTorrent, or deeper integration with “composable” blockchain data plumbing like Celestia.

Data Circulation Dialects

Since Amino usecases were prioritized for a long time, lots of optimization for massive-scale CID lookup sharing between [all] directly-connected peers and massive advertising of each CID were baked into kubo and other implementations. Over time, even these optimizations were not enough and hyperscaling components like Elastic IPFS and IPFS-cluster were evolved to accomodate usecases where millions of CIDs were added every day to massive stores, for whom even indexing CIDs for advertisement became a bottlenecking problem at scale. Similarly, Bitswap (the peer-to-peer protocol which, as the name implies, maximally spreads indexes and advertisements around between central nodes in the system) became a kind of bottleneck even for Amino-focused use-cases, much less the divergent ones.

Decisions at this layer are highly determinant for the other layers, so I think architects deciding which “dialect” of IPFS is best for their use-case or system should probably start here. For them, these might be the up-front questions to answer in a first pass of research:

Is the desired end-result THE global, public DHT (Amino) or, for that matter, some other global, public DHT?
If not, what kind of security perimeters need to exist within the system, if any?
Does data need to flow both ways everywhere in the system, or are some flows unidirectional?
Does each logical unit of data have a TTL, or is it permanent? Should there be garbage collection, based on frequency of access if not by expiry decided in advance?
Will your data be append-only? Or (at least from certain views) mutable?

The answers to these questions can determine whether (and where) a given system will need things like ipns for mutability, bitswap for syncing, ipni for advertising CIDs through channels other than the Amino DHT, etc. If it only needs them in certain places, figuring out “checkpoints” beyond which they are not needed (and even awareness of them is not needed).

Fetch Dialects

We could think of the trustless gateway project as the design precedent for a more flexible and interoperable architecture. The trustless gateway could be thought of as a “checkpoint” where IPFS-aware (but system-unaware) counterparties bridge a perimeter: CID verifiability as no-guarantees egress protocol. Or we could think of trustless gatewars as “border towns” in the extended metaphor: fortified little Andorras, even.

There are already three trustless gateway “modes” asking that data for a given CID be returned in modern/IPIP-0412 CAR envelope, raw IPLD (translating between block-ceilinged CIDs and iroh/lucid-style unblocked clients), or IPNS-record (updatable pointer signed by originator, trustable only relative to that originator’s reputation in an IPNS identity system). Reading between the lines a little, it is obvious that these three distinct modes correspond to radically different assumptions about the architecture in which those modes would be useful.

Just a little specification about what a trustless “mode” can be (i.e., do they always map 1:1 to an HTTP header? what gateway or client behaviors MUST and MAY they specify?) would go a long way towards making Trusted Gateways powerful enablers of reliable and mature translation. If these modes were a little more detailed (i.e. if they were profiles of an architectural component rather than just modes a trustless gateway component can be configured to run), we might get a set of defaults or pre-configurations for gateways that are ready-to-go, just-add-water components of a system that speaks CID at its edges… while allowing more freedom to diverge within their system.

Excursus: Segmented NFT Example System

To make this more concrete, many systems I’ve seen have CID generation strictly segregated from CID publication already, and can thus greatly simplify a system with two distinct parts. Think, for example, of a system that only generates CIDs at, say, the minting process of a batch of NFTs, a process which includes very few actors and components. Once that corpus of images and metadata has been “CIDified” and each NFT has the right URIs encoded in it, this static SQL table of a system switches gears, and suddenly each of those CIDs needs to be available performantly and globally over conventional web technologies.

In this case, how the subsystem involved in generating CIDs bootstraps its networks and passing along its outputs to a more conventional publication system is very different from the network profile of the “mini-gateways”. The first system might need to optimize for privacy, confidentially, obscurity, or even security-by-obscurity, hiding from attackers behind cryptic networking mechanisms. The latter, on the other hand, has a fairly conventional CDN-like networking and performance profile, which might benefit from knowing literally nothing about the former system, lest it be compromised somewhere in its many attack surfaces.

This latter system, which I’m cheekily calling a “mini-gateway,” might be thought of as an “output only”, non-IPFS subsystem of a greater IPFS configuration. This whole output-only system can be handed a bunch of URLs that either are or contain CIDs and referents for each representing various content-types and be completely “unaware” of all the IPFS that generated this mapping. Indeed, that might even be mission-critical for such a use-case.

Exactly because it’s so mission-critical, this entire latter subsystem (which the outside world continues to call “pinning”, against years of trying to unmeme it) is often offloaded or outsourced. In the best of cases, this content delivery subsystem is handed off to a well-thought out service provider business model like that of Pinata or Web3.storage, resulting in SLAs and a defensible business niche for middlemen that make IPFS “just work”. In the worst of cases, lazy web3 SDKs misinform their developer users about the mechanics of Amino DHT and IPFS gets a reputation across entire communities as “not working at all”. The difference between these two is a difference of narrative downstream of how we document the capabilities of IPFS systemS… the plural could be really helpful here.

Next Steps

Mingling history, speculation, and implementer feedback as I have might not lend itself to clear next steps, but I can just put those baldly:

I want each piece of software to have more clearly defined interoperability targets with the rest of IPFS, in the form of novel approaches and dialects that it can implement and declare (in some standard, translation-powering syntax) at each of the four levels described above.

After adding missing features, many pieces of software will need to refactor their interfaces to foreground support for multiple (sometimes even mutually-exclusive) modes, each optimized for different “dialect” targets.

Where multiple options are enabled by a given piece of software, it should be easy to configure subsets of that optionality– “run in Amino mode only” for example, or “disable all CID advertising”. Much of this is already built, but needs to be foregrounded in documentation if any users (or even more crucially, evaluators) are going to understand how many IPFSs they have to choose from.

From here to that future, there’s a whole lot of specifications to write and implement. But on the other side of that work, it becomes a lot easier to maintain up-to-date information on each piece of IPFS software about what other software it works with, and how to configure it for profiled usage. A caniuse.com for IPFS is easy to maintain once its inputs are specified and a v1 launched. It also becomes a lot easier to roadmap futures and features, or sell them.

Is it worth it? Idunno, I just work here, man.

Life Cycle of a Blockchain Standard

2024-01-31T00:00:00+00:00

Co-written with Kyle den Hartog (Brave)

Preface

A persistent source of confusion and misalignment in the Blockchain space stems from a conflation between specification documents and standards. The tendency to think of the EIP process as a complete standards process or, worse, to think of EIP editors and CatHerders as a decentralized “standards development organization” (SDO) does the EIP process an injustice by making it responsible for things outside of its scope. The EIP editors, magicians, and catherders help documents through a document process; whether each document is widely and uniformly adopted (and thus can be called a “standard”) is something they have not been (and should not be!) resourced or empowered to influence. It is our belief that adoption is healthiest and most efficient as a separate (and carefully separated) concern. It is also our belief that supporting and coordinating across different categories of standard is worth doing, and worth doing now, given the juncture at which we find ourselves.

To start with, we should recognize that the meaning of “adoption” varies widely between different kinds of standard, or in EIP terms, between different categories of EIP, particularly as many shades of EVM-compatibility are rolled out. At the level of Ethereum clients (core and networking proposals, as well as Rollup Improvement proposals in the case of Layer-2 clients, i.e. validators), the ethereum client working group (“AllCoreDevs”, or “ACD”) can encourage individual clients to adopt a proposal, and later schedule it onto a hardfork to make it mandatory for all clients, making “adoption” a fairly binary category. At the level of smart contracts and wallets, however, protocol changes don’t come into play and market adoption is both paramount and fuzzy, and currently somewhat ad hoc and thus intransparent to boot. At the level of changes to smart contract bytecode or specific compiled languages like Solidity and Vyper, distinct communities beyond Ethereum coordinate adoption and backwards compatibility. To some degree, wallet<>node interfaces are specified and regularized by the “execution API” and OpenRPC project, but this is not yet an integral part of the EIP process or the standardization process more generally.

As the blockchain space is growing in maturity and complexity, the gaps between these various adoption communities seems to be growing. The monolithic “EIP process” is starting to adapt to these divergent constituencies, who are lobbying for greater control over the standards process as a whole, of which the specification process is just a middle-step. To mitigate the risk of a “splintering” or “siloing” of those processes, this document seeks to describe a “big picture” that might be shared across all of them and create a common language that maximizes understanding and interaction between them.

The Pitch: Define a Common Standards Lifecycle and tailor it to each community

We could generalize the lifecycle of standards in evolving software markets to a 5-stage process:

Explore a problem and identify stakeholders; ideally, field multiple solution ideas
With trusted collaborators, implement a prototype of one or more solutions and iterate requirements/problem statement before scoping a specification to be hardened in public
Iterate implementations and refine specification in tandem, ideally with multiple implementers and participants
Harden specification, ideally after a “feature freeze”
Coordinate adoption and capture implementer and auditor feedback over time to produce errata/updates as needed, and to feed those back into new standards

We could say that stages 1 & 2 are currently optional; they are sometimes done in an ad hoc manner on ethereum-magicians or equivalent discussion fora online, or at public events, leading to an “initial draft” of an EIP, but while this might be considered “best practice” there is little incentive to do it so publicly. Indeed, particularly at the level of wallets and smart contracts, there are many incentives NOT to design in the open what might otherwise end up a differentiating, competitive feature. Many design discussions happen in private or company-internal channels and proceed straight to step 3 by opening a PR containing an initial draft of an EIP.

The current EIP process encompasses step 3, and optionally step 4; the “feature freeze” part is entirely up to the EIP author to propose and incentivize, and can even be skipped over since the EIP process is relatively unopinionated about the normative “content” (features and security or privacy implications thereof), by design. One could say the current process is maximally permissionless, putting a high premium on neutrality. It is worth mentioning that in more formal SDO processes like W3C, IETF, and ISO, there is a designated maximum and minimum time for this “post-feature freeze” review of implications, usually combined with structured horizontal review processes to ensure uninterested parties with security and/or privacy expertise “red team” the specification.

The fifth step is currently outside the scope of the EIP process, to such an extent that security professionals keep complaining that they don’t know how or where to publish security guidance relevant to the implementers of long-final documents like ERC-20. While the feedback loop of errata to new features is closed to some degree in the ACD categories, there is a pronounced lack of an ACD equivalent at other levels to support either function, much less coordinate them into a feedback loop. The following describes ways this could be formalized and extended to other category-bound standing working groups, or even replicated in other decentralized improvement processes.

Stage 1: Initial Idea and competing draft proposals (“finding dance partners”)

In the first stage, the initial idea is to highlight the problem being solved, potential use cases, and a high level idea of the technical idea, defining a problem space and one or more solutions that would benefit from standardization. Input from far afield (especially non-technical and business specialists!) can be a stitch in time here!

This is really about getting the initial problem statement down on paper to determine if one solution might be worth standardizing on (rather than solving independently or competitively). In many cases, even a general category of solution might be premature, although in other cases a sketch of a solution might be needed to find the right counterparties to design it together. The equivalent stage in the IETF would be something like a rudimentary first draft published as an “internet draft” or an informational-track use-case or problem statement circulated for review (usually through the mailing list of a standing working group or even dispatch-wide). Within a less formalized context like the Chromium developer community, the equivalent would be the publication of an explainer. By having a document that’s shareable we can sanity-check the impulse to standardize in the first place, and find collaborators and potential implementers as soon as possible. An ethereum-magicians thread with a few in-depth responses might be enough to be ready to start an EIP process (if a final EIP is the only goal), and that could be called the bare minimum “stage 1” for a specification, but something a little more complex, the captures the imagination of companies or projects rather than individual, researchers and hobbyists, would be table-stakes if for “stage 1” of an adopted, multi-stakeholder standard, at the level of wallets or smart contracts.

These “problem space”/groundwork documents will likely be published on an individual or organizational Github or collaboration tool like HackMD or Cryptpad, and they do not need any sort of formal document process in place for them (unless IP is particularly sensitive at this stage). They’re intended to be low stakes design documents that can be shared and if they’re not retained it won’t harm the development of a future standard. At this stage, there may be many competing solutions to address a problem, and that is often a good sign that there’s desire by numerous parties to standardize on this and move it forward. There might just as well be no solutions yet in prod or publicly described, but in this case, the thing to look for is multiple stakeholders (competitors, perhaps) all recognizing a shared business problem, or a business problem in common that would be better solved by a small number of common solutions rather than 100 incompatible parallel solutions.

In general, Working Groups at SDOs tend to be rightly be wary of taking on work items that are championed only by a single party, that arrives suspiciouly complete; working groups in the EIP context should similarly discourage or at least be skeptical of such specification projects. Perhaps, with enough momentum and institutional memory collected in the relevant working groups, authors or organizations pushing single-author specifications could be asked (or even required) to find a collaborator among the active membership or previous authors in a given category, to provide some credible proof of openness to compromise and co-design.

Stage 2: Prototyping <> Spec-Writing Stage (“crystalizing the proposal”)

Once a goal has been set, collaborative design begins in one or more common solution(s); multiple organizations/projects are investing significant labor in going from idea to specification, and want certain guarantees to raise the likelihood of arriving at a standard some day. Standing Working Groups are a sensible default here, efficiently maximizing feedback and front-loading adoption-oriented coordination.

If the main goal of stage 1 is to identify multiple stakeholders and convince them of the value of working together on a solution (ideally, towards an adoption strategy and a standard, not just a spec), the main goal of stage 2 is convince 51% of them of a shortlist of candidate solutions at a high level, as well as a process for agreeing to implement one of them in common. The primary output of stage 2 should be a shared vision of the scope of the work to be done, a rough timeline, and a “plan for a plan”, if not a detailed plan. Maybe that work includes user research, testing design, or other inputs; maybe that work is just a specification, or even a “retrospecification” of something already implemented two different mostly-compatible ways. But the to-do list is less important than the plan for crossing off all the items.

In an “ad hoc working group” (i.e., a group that works on 1 specification or a discreet set of interlinked specifications, but disbands after that), this could be called a “working group charter”, but as a work item or formation within a standing working group, this might be more accurately called a “specification charter.” In either case, it often helps to cover things like the IP boundaries of the collaboration, resolution mechanisms/authority, ragequit rights, communications channels, disclosures/transparency expectations, etc. Defining all these explicitly can be a major lift, but a standing working group might provide pre-filled templates or sensible defaults for all these questions. These questions are important for people who actively contribute to help resolve conflicts that may arise after the fact as well as helps non-active contributors to know when they may want to provide feedback or when they may want to consider implementing even if they aren’t active in the specification development.

An explicit “spec charter” document should define:

Checklist for a succesful specification project

The scope of the specification
- More specifically specific technical details that will be included within the standard and also any things that are explicitly not expected to be addressed by the specification
- If adoption of the specification is hindered by, e.g., testability, or upgrading mechanics, or sample code, etc., additional artefacts beyond the specification may be worth collaborating on or even project-managing within the group
Success Criteria
- outlines the goals of the group working on this problem, which may benefit from more formality in some cases, e.g. if antagonistic or competing parties want to work together or there is already money or market “territory” at stake
- normative or informational documents can be ratified by multiple parties (e.g. user stories, requirements document, etc)
- optional test suite report to easily review self reported compatibility with the spec of implementations (particularly important for interfaces, e.g. between dapps<>wallets)
- in leiu of running test code, at least a “testability review” by people familiar with both the use-case and test design can be a good “gating condition”
Communication channels: some combination of the three usually required for complex topics:
- ad hoc: “group chats” or confidential channels may be needed to coordinate implementations that touch sensitive IP or security topics
- archival: “mailing list” or forum software that archives threads, including GitHub issues and discussions
- synchronous: recorded “calls” or meetings that produce meeting notes and/or recordings, idealy at archival/long-lived URLs
Conflict Resolution mechanism:
- “rough consensus and running code” ¹ may sound explicit enough a resolution mechanism, but can be refined or made explicit in many ways:
- sometimes having a named “consensus keeper” or tiebreaker (equivalent to a specification “editor” at W3C or IETF) helps forestall show-stopping conflicts or manage ragequits
- nuancing the definition of “running code” (production-grade? deployable? running on vercel? audited?) is often a worthwhile exercise, to make sure the code in question adequately demonstrates “skin in the game”
- a spec with too much optionality or spanning multiple distinct solutions to the same problem can be unsatisfactory for many (and has a much harder road to becoming a “standard”), so in some cases optionality is worth constraining in advance
[If applicable] Intellectual Property Boundaries or Guarantees. The default in EIPs to date has been Creative Commons, but particularly at the application level more nuanced or restrictive

Often, many of the above are decided implicitly or on-the-fly by the individuals involved, rather than their organizations or process-helpers. As the stakes rise over time, however, there are benefits to being more explicit about them, even if just in the form of GitHub “issue templates” or other formulaic guidance docs. Since working groups covering certain categories naturally accumulate experience with these issues and are familiar with the often unstated business incentives and market dynamics at work, it is far simpler and faster for them to suss out these risks and make these agreements more explicit internally, having already assembled the right people.

On a mechanical note, these same working groups can also centralize (and cross-pollinate) these processes by hosting repositories, document collaboration tools, and/or discussion channels for each current and former standard. It often makes sense to “fork” older ones or related ones to iterate on documents and artefacts originally published elsewhere (whether by another group or by some of the same participants at Stage 1). Using an issue tracker in that repository is recommended to project-manage this stage, since the next stage will likely be centered on the repository somewhere else with its own issue tracker (and perhaps a very different document process and linting regime).

Stage 3: Editing in the open (“hardening the specification”)

The goal of stage 3 shouldn’t be getting to Review status, but rather, significant adoption and consensus to make that Review phase meaningful– getting the former without the latter can add noise to the channel and erode confidence in the process.

The “charter” (and in particular, its success criteria) could be a very explicit, formal, serious document or it could be a github issue created from a template; in either case, it is best for it to “follow” a specification from its small incubating group (whether in a standing working group or not) into the more public arena of EIP hardening and review by a wider public. This allows the focus to be on the specification qua specification; if other work streams have to be parallelized (like implementation feedback, testing artefacts, sample apps to facilitate evaluation or implementation, etc), it can continue off to the sides, with or without being explicitly referenced on the editorial threads discussing the text of the specification itself.

The degree of formality of this “charter” and how much it bears on the hardening process of the text itself is definitely up for debate, and balancing flexibility against adoption-maximizing mechanisms is difficult. More formal standards bodies generally have a hard requirement of charter-approval by some kind of Technical Architecture Board or other cross-working group coordinating body. That might sound too centralized or top-down for a decentralized stack, but some middle ground might be worth reaching where “approval” by a sponsoring working group, or by one or more coordinating bodies, serves at least as a market signal, if not a hard requirement to EIP review.

If anything, the charter process should be considered orthogonal to the specification hardening process; instead, an explicit charter is an upfront investment in adoption and concensus around the specification being hardened. The value in explicitly “signing up” to a specification charter before or while collaborating on the specification is that offers an early opportunity for concerns or objections to be expressed and addressed. One consequence of more formal and public forms of consensus-building is that conflict resolution might also benefit from a little formality; this should be monitored as working groups develop independently.

Once an EIP is in “draft” status and being edited in public, the goal of the 3rd stage could be defined as refining the specification (on the EIP side), achieve the success criteria of the spec charter (on the non-EIP/WG side), and to ultimately lead to the development of multiple interoperable implementations if the goal is a standard, not just a specification. In the case of All Core Devs, this is also going to be the time in which a implementations will be tested through the test networks. In the case of Wallets and hopefully ERCs, different measures of adoption “viability” should be satisfied, such as turning sample dapps or implementation artefacts into test suites or conformance regimes, or prototyping actual builds. These implementation activities aren’t strictly necessary to a good specification and should probably never be hard requires of the EIP process itself, yet if the goal is an adopted standard and changing behavior in the market, standards-oriented project should probably coordinate them at this stage and benefit from feedback between specification and prototyping. Debatably, some working groups might even want to link the two processes explicitly or track implementation feedback in the same threads and repos where the specification is happening, to ensure and maximize that feedback.

It’s been suggested that one way to improve draft specifications is for prototype and user-research discussions to take place as close to the sspecification a possible, e.g. in Github issues, links to other repos, and discussions. This might be too prescriptive if applied across a whole working group, though, as some communities or teams may prefer their discussions to center on recurring calls or informal communication channels such as Discord as well. It’s up to the group developing the spec to define the ways in which they want to communicate, but it helps to keep the focus on feedback and coordination as activity spans out across private implementations, prototypes, user research, etc.

Stage 4: A more robust Review phase (“Landing the consensus plane”)

The “review” phase can and should be more than a polite last call for objections–it should be where the circle of adoption grows and valuable feedback comes in from a different class of implementer.

As currently structured, the “Review” phase (and the minimum-two-week “Last Call” phase that ends it) represent a completed, finalizable document receiving its last feedback before going final, at which point the EIP process ends (and adoption happens, or doesn’t, completely outside its purview). There is a sort of unspoken rule that substantial changes “reset the clock” and extend the time for review, particularly in “Last Call”. It is a very finality-focused process focused on the specification at present, and there are often implied or interpreted stakes of importance or legitimacy mapped onto that status.

We propose rethinking this “home stretch” more holistically, as a chance to expand the circle of implementers beyond those who committed in advance (e.g. by collaborating on a charter or contributing), and for listening to implementer and evaluators further afield. Instead of thinking of this stage as “cement pouring” after which no interventions are possible, we could think of it as the transition from specification-driven to adoption-driven, opening the latter door at the same time that the former closes. Particularly in cases where adoption and consensus have been supported all along by supplemental artefacts and public events, the “last call” period can be a springboard for more adoption, an occasion to reach out to participants that dropped off, and a time to start thinking long-term after the afterlife of the specification.

These cases, where adoption and standardization is the goal, might be difficult to coordinate without, e.g., the non-specification deliverables slowing down the usual specification timelines. We believe this stage will vary widely, and we should resist the temptation to complicate or formalize too much the expectations of this stage, even in specific working groups, as a greater degree of formality and adoption-oriented supplemental work could multiply the stakes and the potential for serious conflicts. As mentioned above, more centralized standards organizations handle differently objections that arrive after “feature freeze”, adjudicating them with some kind of organization-wide Review Board (often Architectural or Technical in scope). This approach would be harder to replicate in some categories than others, but working groups could certainly adopt some version of this (whether mandatory or recommended) as they tailor their processes.

There is a temptation to formalize the relationship between the specification-finality side of stage 4 and the “running code” or product-launch or testing side of stage 4, to protect early implementers from major breakage with minutes left on the clock, but this crosses the streams of specification and adoption a bit. We do not feel that a one-size-fits-all solution makes sense here, even working-group wide, although perhaps it would make sense to consider this less as a gating condition or hard requirements than as an optional sidequest, such as an orthogonal (but public) badge of approval or working group “certification”.

Stage 5: Standards outlive their founding documents (“Update, Iterate, Inform”)

Sometimes the best implementation reports comes months after adoption picks up– after the painful hack post mortems, for example. How to create upgrade paths and feedback loops informing future documents and standards?

Currently, the EIP process is optimized for “immutability”– much of the editorial review gating “Review” status is focused on maximizing the longevity and maintainability of documents, on the assumption they will never be changed again. There is no errata process currently, nor a mechanism for searching the current EIP corpus for later EIPs that update or extend an earlier EIP, much less dynamic “forward links” on an earlier EIP to later EIPs. The only way to update an EIP is to publish a new informational one, and then make sure the right people see it somehow.

While any or even all of the above-mentioned mechanisms would be welcome, they are hard to implement permissionlessly without some kind of counterweight to self-attested claims of relevance. This is partly because stage 5 isn’t really about documents or specifications, but about interpretations, implementations, ongoing conversations outside of the documents. While more formal standards bodies use testing bounties and standing topical working groups to gate new specifications and monitor adoption of finalized one in a closed and expert-filled feedback loop, that kind of centralization might be unpopular in our space. What is the more transparent, do-ocratic equivalent that relies less on authority and more on sweat equity?

In communities such as AllCoreDevs where there is a large importance placed on specification and testing, implementers and specification writers past and future are already in dialogue, so this fifth stage happens automatically, or “falls out” of other communications channels and discussions. In the smart contract and wallet spaces, however, business incentives do not align with and support this kind of feedback, and figuring out what mechanisms will support it cheaply, without being onerous on all involved, is something of an open question.

Our intuition is that working groups that share a similar function to AllCoreDevs, which require of their proposal authors a certain degree of prototyping and adoption-strategizing and prior-art research to happen before taking on a work item, is a low-risk place to start. Top-down solutions beyond that seem premature, but coordinating and finding a common language across categories for what these working groups do would allow them all to experiment and compare results before trying to generalize common solutions. Kicking off the Wallet Working Group seems like a great start, now that the wallet testing framework is achieving maturity and we can run experiments like asking authors to link to prototypes that implement proposed features while fully passing the WTF (a kind of lightweight “reference framework” requirement that aids adoption and evaluation).

To put it another way, Stages 1 and 5 bleed into one another and are best done by the same people or in the same places, but both are currently done in an ad hoc way at best, by volunteers or passionate individuals, sometimes on long meandering ethereum-magicians threads and sometimes not done at all. Insofar as Stages 1 and 5 are a recognizable part of standards process in more formal, less permissionless environments like technical SDOs, the blockchain community’s treatment of them could well be the most underfunded, undersupported, and underdeveloped part of how we “do” standards, and the lowest-hanging fruit in doing them better. We firmly believe that bridging stages 5 and 1 is best done by standing working groups, and we stand ready and motivated to help this crucial function be fulfilled across the space.

Next Steps

We have tried sketching out how to start thinking about a broader standardization process beyond the publishing of documents, and we hope reframes the familiar questions and expectations around EIPs. We have tried to stop short of prescriptions or excessive formality, while still signaling where centralized standardization has landed in its solutions to analogous problems. We hope that this document can percolate and influence people over time, on a long-term vision level rather than on a short- or medium-term tactical level.

Tactically, however, it is 2024 and there is a distinctly wintery chill in the air, prices notwithstanding. Most employed heads are down and furiously building, making this a great time to shift incentives and encourage more transparent alignment and coordination on the dapp and wallet level. This coordination takes many forms, which could add up to a lot even if each seems small on its own:

The Wallet Working Group is a great place to start, and we’ll be socializing a charter for it in the coming weeks. The Rollups Improvement Proposal spinning out from AllCoreDevs might also take on distinct “working group” functions and host different collaborations that might be drowned out in ACD. Perhaps an ERC WG might be worth rallying as well?
We also stand ready to help these and other working groups form and/or formalize with similar intentions around stages 1, 2, and 5 of the process outlined above, both as individuals and as helpers in the Chain Agnostic Standards Alliance. Multichain/multi-L1 engineering is proceeding at a steady clip in the space, and might also yield unexpected constituencies of collaborators over time!
Ethereum-magicians.org is a great place to model this kind of feedback– Working Groups should think of their Categories there as a kind of interface with the universe of possible authors and contributors, or the “top of their funnel” topically, and take forum curation seriously as a way to build consensus and mindshare for their working group. If reading this inspires you to comment a little more generously, in more detail, and more often, on eth-mag and similar fora, and a few people each across the various categories do the same, then we will be halfway to thriving working groups in a few weeks, even if there are no formal working groups to declare ourselves “members” of.
On Eth-mag and elsewhere, we should identify EIP and CAIP authors with ambitious ideas and try to get them help thinking through adoption and all of the above. Not even specification has to be a standard, but choosing early can save lots of people a lot of pain. Heck, just send them a link to this essay if they’re the long-form type!

Acknowledgements

The authors would like to thank, in no particular order, @chaals, @SamWilsn, and Annett Rolikova for reviewing this before it was published, and you for reading this far.

See A.L. Russell’s 2006 article in IEEE Annals of Computing Hstiroy for a detailed history of the phrase. ↩

Spinning Up the SocialWeb Coöp

2024-01-12T00:00:00+00:00

The years they keep on passing

Since 2020, my (bumblefudge’s) career northstar has been sustainably making myself helpful to the advancement of human agency in the digital realm, an umbrella that encompassing disparate work in the “self-sovereign” identity-tech world, the “on-chain money” world, the “interplanetary data” world, and the “social web”. In fact, in 2020 this work didn’t even feel all that disparate, and only when checking back in with my feelings last week did I realize there was more under the umbrella than I realized at the time. Thematic focus aside, the euphemism “sustainably” is doing a lot of work in that first sentence; I try never to work “for free,” even more so now that I am a father, but naturally higher-paying work subsidizes lower-paying work when I’m lucky enough to get both.

Doing so while trying to “own” any part of my work (to seek rents in the present or future from that ownership claim) never sat well with me, so I have not taken equity, intellectual property, or even cryptographic (“on-chain”) tokens as payment for my time in any of this work. Perhaps there is a transparent, ethical, no-risk way of doing so, but I have been too busy playing connect the dots and learning all I can to bother with those details; “maybe later,” I say every time it comes up.

For as much as I never wanted to “own” a rent-seeking apparatus, or for that matter a “company,” it quickly became clear that in much of today’s world, and particularly in the software world (as befits an industry centered on seeking rents from intellectual property), companies get better property rights than individuals, and companies (particularly foreign companies) prefer to deal with other companies anywhere they can, as do governments. The conventional path of working across organization borders by “freelancing” doesn’t really work as well across national boundaries, however. To de-risk and simplify this situation for my clients, collaborators and the funders of agile prototyping and R&D efforts, some of them public, Balázs and I incorporated learningProof UG in 2020 precisely to “abstract out” the legal requirements and liabilities required of employment, contracting, and consulting, letting me (bumblefudge) and, as needed, my collaborators and colleagues on various projects focus on the work at hand, tracking hours and minimizing administrivia. To date, it’s mostly been a “pass-through” (or a “shell corporation” as I like to joke), simplifying how I bill my hours when I’m working for people outside Germany, which most of my clients have been. It has worked great for spinning up (and down) lightweight collaborations across and beyond the EU to do the kind of specialist tinkering that is required when you are proposing novel infrastructures for future internets.

But last year was different. The utopian vision of a more just “Social Web” seemed to hit everyone all at once in 2023 like some kind of ideological pandemic, as the critical communications infrastructure we euphemistically call “social media” underwent one of those “multicrises” that are so in vogue these days. Most of the crises reported as distinct were, to the cynical eyes, simply two sides of one coin: they governance crises triggered by omnipotent billionaires indulging in seemingly personal brinkmanship was just the tabloid sideshow distracting us all from the real macro-narrative: with the sudden end of the US Fed’s zero-interest experiment, the shareholders of virtually every major technology platform demanded rapid “enshittification” (forgive the technical jargon) of their formerly benign-seeming freemium platforms, seeing no other way to quickly ratchet up dividends to perform above interest rates. Putting critical infrastructure for democracy and global information flows into the hands of advertising monopolies governed by private equity and openly maximizing profits above all public obligations and functions was starting to look like a bad idea even to observers who generally like public infrastructure to be operated for private profits.

What could little learningProof do to tilt at these world-historical windmills? A privately-held, profit-maximizing endeavor (however small and agile) was neither the right shape nor the right speed for this side-quest; the vibes were off, dear reader. Billionaires and boards running information business like health insurance companies got us into this mess; whatever was the opposite of that seemed a good way of going 180 degrees in the opposite direction. We needed to think bigger, flatter, not just learningProof but enshittificationProof, profitLocal or even profitNeutral, sharing the work and its wages do-ocratically and humbly. No C-suite, no shareholders, no masters to whom profits could be expected or demanded to be exfiltrated.

Wait, who is we, you may be asking? Wasn’t the first-person singular until now? For now, the we began as @codenamedmitri, bengo.is, and myself. I have been collaborating sporadically and coordinating efforts with the former since 2019 on all things decentralized identity, since almost the minute I understood that phrase to refer to one path to human agency in the digital realm. Unbenownst to me, I somehow failed to meet the latter at an ActivityPub conference in Prague that the former invited me to when I was in town for the 2019 Rebooting the Web of Trust conference a few blocks away.

Over the last two years, I’ve been discussing ActivityPub more and more with both Bengo and Dmitri, collaborating where we can, aligning our disparate efforts, particularly in the context of ActivityPub, a headless protocol formalized in 2019 and self-governed in the most wonderfully and chaotically decentralized way. Our collaboration started as a simple, almost daily groupchat about current events, filling one another in on all kinds of backstory and perspectives on what was unfolding every day on the various relevant hashtags, making sense of it together. It quickly grew to feel like a team working together on shared problems and toward a shared vision.

We decided that a conventional shareholder corporation wasn’t the right fit for working on public infrastructure verging ever closer to being “critical”; many of our role-models in public-good technology were B-corporations, non-profits, unincorporated collectives, and worker-owned co-operatives. The last of these felt the closest to our do-ocratic way of working and our transparency goals: a conventional LLC, except incapable of selling equity, keeping 100% of the governance of all assets, projects, and liabilities within the organization.

The three of us agreed from the beginning that this protocol, in comparison to other global networks of loosely or trustlessly federated “servers” (matrix, secure scuttlebutt, blockchains, SoLiD, etc.), somehow grew to serve millions of users without a reference implementations and a conformance regime; a community of doggèd developers gritted their teeth through a lot of ad hoc coordination and experimentation to arrive at an open social graph that worked well enough for its millions of emmigrants from the industrialized attention economy commonly referred to as “social media”. In 2023, the social media & personal messaging “sector” of the software industry from which this ad hoc federated social web had rebelled and run screaming was suddenly collapsing and people were pouring out of the exits; the “Fediverse”, long artesanal and proudly devoid of an internal economy, might just need to blitzscale and adapt to a reality in which some commercial or semi-commercial form of Fediverse couldn’t help but develop.

At the same time as we were speaking to each other about how the Social Web might harness these market forces, regulatory groundswells, and cultural narratives, we were also speaking to developers across our various social and professional worlds. It was clear that ActivityPub, the protocol out of which today’s Fediverse grew as a tentative first instantiation, was really no more visible than in 2019 as a vision or as a reality. The Fediverse was experiencing growing pains (as millions of new users poured into web2 server architectures that had never really been optimized to scale efficiently), but also upgrading pains, as there was no medium-term shared vision on the horizon behind the immediate technical debts or user needs that plague software this ambitious. In the most practical sense, ActivityPub had never become a “protocol” anyone knew or thought about distinct from the Fediverse because it didn’t have a reference implementation and a conformance system anyone could use in decision-making about its capabilities, its strengths and weaknesses, and its role in the greater web; all we had was a few implementations, interoperating with one another patchily, and no one incentivized, much less funded in a sustainably and credibly neutral way, to get us to that point.

Left to play out naturally, these market conditions would likely push the already ad hoc Fediverse to move rapidly further from the ambitious (and admittedly difficult) path imagined by the open protocol’s early designers. Without a lot of work on the neutral core that powers the federation of diverse software and user experiences and products, the goals of the ActivityPub community whose conference Dmitri invited me to crash in 2019 would likely get lost in the shuffle. And without the typical foundation people have when trying to build on an existing protocol or platform (i.e. a reference implementation and a testing regime), who could blame selfless community-motivated developers for optimizing for local maxima and clearer, better-documented goals?

Sovereignty and Technology

Fortuitiously, we weren’t the only people thinking about credible neutrality and the kinds of investments needed to defend the digital Commons from market forces in 2023. Our conversations about ActivityPub landed in the right place at the right time, in that they coincided with the entrance of a major new force in the funding landscape for open source. It was a big year for public-good technology in Germany: increasingly, the German Fediverse was coming into its own and trying to strategize about the culture clash implied by massive US-based companies entering the Fediverse at the Chaos Communications Camp and many smaller fediverse- or data-sovereignty events.

At the same time, the Sovereign Tech Fund was emerging to channel energies from the policy and civil society communities trying to navigate these topics. Incubated by FOSS powerhouse SprinD GmbH, STF was establishing itself as a funder of exactly the kind of credibly neutral Free Open-Source Software that favors protocols over platforms. Sovereign Technology Fund’s mission, and in particular it’s pursuit of sustainable, long-term governance for open infrastructure resonated with our project to keep ActivityPub (the underlying protocol) up to date with the evolving product landscape built on top of it. We worked with the exceedingly excellent @tarakiyee@mastodon.online to scope a service contract that would fall squarely within the mission of STF and also be timely to the evolving landscape of the Fediverse, focusing on industrial-strength CI testing to make protocol conformance more objective and protocol extension more time-efficient, user portability, data portability, and developer community integration. It was a match!

One problem with a cooperative is that you can’t “raise” “money” (or, less euphemistically, you can’t borrow cash or pre-sell governance rights against the future organization); you can only keep a portion of the money passing through the organization on its way to the workers doing the work, and use that to amass operating capital for collective outlays. The coöp had, in Sovereign Technology Fund, its first paying contract to bootstrap itself into existence, and Sovereign Technology Fund was happy to be investing (non-dilutively) in a sustainable enterprise committed to keeping open infrastructure open, as usable. Most contracts, however, are signed between pre-existing organizations, so learningProof is, on paper, fulfilling this contract, as a pass-through, fiscal sponsor, and liability host.

Milestones and Updates

The contract we signed was broken up into a few workstreams (testing, new features, and community support/integration), which structure the contract into three phases punctuated by three milestones.

We’ll be posting grant reports here on the coop project’s own website, as each of the milestones of the project are reached and outputs thereof open-sourced. In the meantime, stay tuned to the #ActivityPub channel on the Fediverse for developer chatter and the mailing list of the W3C Community Group for standards topics.

New Year, New Internets

2023-12-31T00:00:00+00:00

Bumblefudge here, clearing his throat and mind before hitting reset on various expectations for myself and the world.

2023 Year in Review

2023 was wild: my first child was born, I took the longest sabbatical I’ve taken since my professor days, I made some drastic moves professionally and geographically:

I started on a long road with Protocol Labs to try my hand at the ecosystem steering game, figuring out how multiformats could go from “some repo on github” to an IETF working group (god, and IETF’s membership, willing!), which only has a chance at working if I concomitantly get my hands dirty throughout the greater protocol labs interplanetary archipelago, working on conformance this and specifications that as that archipelago nucleates out into something even more anarchic. For as corny as it sounds, it has truly been a joy to get to know both the immediate collaborators in my core team and also the whole ipfs-loving community of collaborators, contributors, and even loving defectors and re-implementers, from whom I have learned so much in 2023 that I should probably write another, longer piece just to fail at doing justice to their intellectual generosity. Lucky is the researcher who gets paid on the job, even when the job is not this novel and bizarre and ambitious and world-building.
CASA and Wallet Connect continue to work in the largely greenfield space of blockchain standards and multichain baselines, continuing next-gen work on UCANs and varsigs and other collaborations between members while pushing forward the ever-widening profile work on namespaces.chainagnostic.org. The first Wallet UnConference at DevConnect Istanbul was a ton of work (particularly for Fission, Kyle and ligi!) but I think it was justified by the immense energy poured out by applicants and participants, who validated out hypothesis that the industry is ready for this kind of non-commercial collaboration and dialogue. In parallel, conversations across the Ethereum Foundation, the EIP editors and catherders, EIP authors, the Fellowship of Ethereum Magicians, and Consensys yielded medium- and short-term plans for bringing the CAIP and EIP processes closer together (as well as bringing the EIP community into closer interaction with IP processes throughout the greater EVM space), which I look forward to executing in 2024.
The Decentralized Identity space is retooling and shifting substantially as the VC working group at W3C goes to CR status with version 2 of their specification, and the SPICE working group spins up at IETF to continue an opinionated subset of that work with the promise of faster iteration cycles and integration with SCITT and other fast-moving items moving through fellow-traveler WGs at IETF. While consortia and megafauna plan their rollouts of subsets of this tech, the purer research and design work continues apace, even here at learningProof, where long-time core member and specification powerhouse Dmitri Zagudulin started drafting did:webhash as an upgrade and extension to did:web to support new architectures and projects across the education space but also the decentralization and alternative publication spaces.
Bumbling around the ActivityPub space has been a humbling and eye-opening experience, which has made me realize time and again how limited and biased my approach to software and its particular sociology has been despite my decades of thinking myself wise. The Fediverse is truly an alternate social web where our industry’s common sense fails and our stutters. I’ll follow up with a separate post just about how these standards are evolving and learning proof’s role in driving them forward, but suffice it to say that ActivityPub occupied as much brainspace and dreamspace as decentralized identity, blockchain standards, and content-addressed interplanetary data in the second half of 2023, and will again for at least 2 quarters more, if not the rest of this decade.

On a more banal, personal note:

I went to my Chaos Communications Camp, my first in-person TPAC (W3C annual conference), and also my first in-person IETF (triannual conference). At all three, I talked to new faces and familiar faces about all four of the above multi-year projects. All three were a rejuvenating jolt of new ideas and new insights into histories I thought I knew better than I did. At and as a consequence of all three, I met a number of new “standards people” and veterans of the sort of work learningProof was established to support and drive forward.
One of those new faces lobbed this little grenade over the wall to push forward the conversation about IETF’s role in a changing world and it’s starting to feel like the ground is shifting beneath us, radicalizing and politicizing the average tech worker ever so slightly. The Summer of Protocols, in which I didn’t participate at all despite living in Berlin, lasted well into the winter and feels a little like the decentralization world’s equivalent, wondering and wandering towards SDOs and the formalization of protocols (and governance) as an inevitable next steps. There’s something in the air, and it suddenly feels a lot easier to explain to people what all this has been working towards!
I am having a harder and harder time differentiating my professional life from my personal life, as my core team on all these projects are as much friends as colleagues. At this point I don’t really need much extrinsic motivation to keep doing the work I do–I’m mostly powered by peer pressure at this point.

2024 #squadGoals

Going forward, 2024 might just be the year where learningProof’s blogs are written in the first-person plural, since all my goals for this year are really goals for various uss.

in 2024, I want multiformats and CIDv1 and all of the shared tooling that sits on top of them to evolve from being interplanetary™️ technologies to being community technologies, finding some more public and credibly neutral position as shared resources and foundations across more independent stakeholders and developer communities.
in 2024, I want CAIPs and EIPs and IPIPs and RIPs and all kinds of other IPs to stop reinventing wheels and develop their own kind of educational/onboarding flywheels, bringing each other’s authors and implementers into a closer sense of intelligibility and “horizontal review.” I don’t think it matters too much whether this community congeals into some kind of “organization” with officers who Speak For it, but I do measure success here by how the average author of an _IP document feels like they got enough support of both fair, neutral helpers (editors, catherders, janitors, cross-org scrummers), but also that there was a community chiming in and giving constructive feedback. I thought a lot in Q4 about what an SDO for the decentralized world (or any subset of it) could look, and I don’t have answers yet, but it feels like various things are cooking in various groupchats I’m in, so with or without the “org” part at least some groundswell of SD is happening.
in 2024, I am hoping to keep bringing my daughter onto calls and normalizing taking zoom calls from my bike, moving trains, daycare pickups, parks, dogwalks, restaurants, and beyond. life/work balance is for the weak!

oh and we need a new internet, this should be the year we actually make one.

lfg, –bumble

the next big thing is you

2019-04-19T00:00:00+00:00

(originally published on Medium.com)

tldr:

Over a year ago, someone asked me for a personal essay on why I care so much about decentralized identity. What I ended up with after a weekend of freewriting was more manifesto and jeremiad than explanation (technological or emotional) but I went with it and published on Medium. I’m including it here as an overview of the context in which LearningProof does its work. The original tagline was, “An optimistic introduction to what could come after the total sovereignty of the VC-funded cycle of disruptions and consolidations” and in this context I would only add that at LearningProof, we do only (and anything) we are confident moves our world closer to that “after.”

Various new kinds of software, we are endlessly being told, are the Next Big Thing, just about to disrupt like a volcano of New e-Things that will certainly upend all of the things in the next X years. After you’ve read enough in this genre, it turns into a guessing game: will they put X at 5 years, or 9, or 7? You can usually tell by the adjectives in the first three sentences, as the rhetoric is at best overblown Ciceronian pomp and at worst Ted Talk runoff. I like to fill out a bingo card seeing how many rhetorical and ideological crimes I can identify in a given breathless Medium “article” promoting a new startup, written by someone literally overleveraged in the success of its product offering. Here’s a partial list, in case you want to make up bingo cards of your own:

Misappropriated macroeconomic jargon
Consequences of disruption and other economic violence naturalized and downplayed via bastardized theories by Darwin or Malthus
Milton Friedman-esque swipes at central banks as irredeemable cabals that hate freedom
The word “gamechanger”
Endruns around anti-trust law dressed up as Quantum Leaps for Mankind
Any pricepoint under $150/month dismissed as “less than you spend on your coffee every morning”
A photo of a “founder” or two standing on a stage at a trade show, preferably wearing a cordless mic

Market prolepsis
Smarmy appeals to how obviously governments can’t be trusted with the task of regulation, or with “data” itself

Venture capital is the real audience of these missives, and, as Mike Judge’s blunt, Aspie CEO seminally quips on the HBO comedy, Silicon Valley, “The stock is the product.” The winner of this debate tournament is whoever promises the most disruption, since that is what the gamblers came to bet on. As we say in Spanish, “A río revuelto, ganancia de pescadores”—when the waters are choppy, [only] the fisherman comes out ahead. (And no, there are not fisherwomen in this analogy.)

These mammoth disruptions very rarely correspond to giant technical leaps, however; most of them are results of the tiniest of innovations in user experience design, marketing, or convenience engineering. From a computer science point of view, these disruptive apps are apex predators on many levels. They centralize or repackage the data traces left by human experience in a tidy, privatized bureaucracy of monetizable information, but to do so, they stand on the shoulders of data processing giants, mammoth infrastructural investments, decades-long collective refinements funding by private-public partnerships and backroom deals with national-security agencies. In just a few short decades, to the tune of neoliberalism’s mantra (“but who will pay for it, surely not me, or us?”), all of this mammoth infrastructural apparatus was rapidly and irrevocably privatized in both legal substance and public perception. The casino of speculative finance not only wrested away from government any control or even regulatory power over the internet “industry,” but in the process it has also convinced the public that many new, dangerous economic practices and social structures are permanent, natural, and inherent to “the internet age”.

Increasingly, access to monetizable data is distributed even less fairly than access to capital or to credit, while we keep being told ad nauseum that this is just how the internet works, end of story. As public distrust and even anger grows towards the ever-consolidating data traps holding our baby photos and high school acquaintances hostage, we are told that there is no alternative to trading privacy and anonymity for access to a public sphere hosted on private servers and monetized for shareholders. The internet has been defined as a giant machine trading us convenience for our data, which it must necessarily convert into fodder for “machine learning” (aka job-eviscerating automation) and dividends for shareholders (offshored and undertaxed, if taxed at all). We are told that our private thoughts and personal details are the lifeblood of the whole system, that without our data there is no there there, that no business or information technology or convenience is imaginable without it.

I am here to tell you, dear reader, that the next big thing in software is the opposite of all that naturalized and coercive data profiteering. The Next Big Thing is granular and total control over all the data pertaining to your “account” anywhere you open one, with no other data held there, and none of that data shared with anyone else. The next internet will not be a giant pyramid scheme devised to pry your data from you and then sell it to third parties. The Next Big Thing is not privacy or anonymity, although it includes a systematic right to exercise both a lot more easily and often. The Next Big Thing is granular data sovereignty and more sophisticated interactions of the public and private spheres. The Next Big Thing is Self-Sovereign Identity, as it is called among its devoted nerds. You are the Next Big Thing.

To the non-technical, but politically savvy reader, “sovereignty” might seem a hyperbolic term to use for better data controls, but there is a crucial distinction to be made between having the “right” to have your data “forgotten” by request (a right enforced by government regulation and by after-the-fact fines) on the one hand, and on the other, having the power to easily and conveniently delete your own data anywhere it’s stored (a power guaranteed by the design of, and exerted directly through, the data structures themselves). The latter power is sovereignty; the former right is a 19th century form of liberalism held limply in check by shaky, contingent regulation, the kinks in which are still very much being worked out. And supply-side regulation at that, not the most historically effective or popular at time of press.

But speaking of unpopular supply-side regulations, the European Union has recently started enforcing an ambitious set of supply-side data protections, limits, and regulations, a mammoth piece of continental legislature commonly referred to as GDPR (“General Data Protection Regulation”). It was the result of millions of hours of research and multiparty negotiations, a finely-engineered Great Wall of a law, which basically set out to update aging legal concepts around privacy and consumer protections, disincentivizing coercive and secretive business practices rather than banning them outright.

No one in the industry missed this epochal challenge, particularly no one reading quarter call reports. The data industry was, and remains, shook. Every internet business entangled in the ecosystem of data capture and data processing is currently scrambling to shut down a whole continent of its data harvesting operations, rewriting their already-baroque, hundred-page terms of service contracts and privacy policies statements to reflect newly-obligatory protections of the end-user’s rights. Other than a global tidal wave of emails notifying these end-users about updates to those contracts, a less legally-savvy internet dweller could totally miss the importance of this legal paradigm shift. But lawyers and accountants at massive conglomerates like Google have put serious resources behind simultaneously fighting against and planning for this sea-change for years, even if it could seem they’re still playing catchup today. In fact, most of the key players in the data industry will need many more years of retooling to come into full compliance, and in the meantime they have no choice but to budget in massive non-compliance fines levied by the day. Middle-sized data concerns have, so far, been the hardest hit (relative to their operating budgets), but the future of these legal battles is hard to predict, given the amounts of money and jobs at play, much less the politics. At least insofar as the same was said of the Big Banks after the speculation crisis of 2008, perhaps the Big Data concerns are too big to fail, or at least, so big that their failing in the near future would be a hassle to the interests that the EU represents.

But focusing too much on how GDPR affects the big players in today’s data economy is giving far too little credit to the EU’s long-term vision of competition and protectionism. I, an optimist, see the real long game of GDPR to be one of distracting and slowing Big Data, buying some time for the little guys to keep trying radically new things. In my opinion, an understudied positive outcome of GDPR is that it stacks the investment and licensing cards in favor of any business minimizing its reliance on re-appropriated personal data and psychological manipulation. GDPR is a godsend for everyone trying to build slower tech, more ethical tech, tactical tech that Silicon Valley would not only never invest in, but has historically tried to strangle in the crib to protect its deep investments in the most nefarious forms of surveillance capitalism. (Full disclosure: I live in Berlin and I contribute in small ways to various non-profit and commercial projects, in addition to doing paid research and consulting for an SSI-specialized research firm called The Purple Tornado.)

One thing that GDPR seems specifically geared towards encouraging is new kinds of networking and services built on a foundation of Self-Sovereign Identity, which many glibly (and perhaps inaccurately) summarize as “GDPR-compliant by design” at a time when GDPR compliance seems a distant fever-dream to every large internet company operating in Europe. Much like the nerdier, more public-sector-oriented corners of the broader blockchain ecosystem, or the peer-to-peer networking scene, self-sovereign identity is currently more of a community than an industry, with engineers and ideologues heavily represented at conferences without much of the smell of money in the air. There are a few standards organizations working out the interoperability of future large-scale platforms and infrastructure projects, and these have, to date, been relatively good about keeping a place at the table for smaller, non-profit players and government initiatives. There are conferences and whitepapers, there are roundtables at Davos and reports by industry futurists and public-good technologists. There are small, closely-watch trials being run of government systems built on a foundation of SSI in Northern Europe, Switzerland, and Canada.

For obvious reasons, there aren’t huge pots of speculative capital rushing this or that specific initiative to market; for the most part, SSI is starting small and open-source, mostly funded by government grants and long-view R&D investments from industry giants. Of the few companies incorporated and running a payroll already, a surprisingly high portion are B-corps, “social enterprises,” and true, independent non-profits. SSI tech is not coming soon to the walled garden of a cellphone app store near you. But if you follow the weirder, more cutting-edge trade papers and tech conferences, you might hear whispers blowing through the trees.

In many ways, SSI is not a bold new vision of the internet, but a return to the foundations of the internet, when the costs of building new infrastructure were still openly shared and the Sharing Economy hadn’t been patented and monetized yet. (To those that would challenge such a linear intellectual history, I would perhaps caveat this reference to the “foundational” vision of the internet as perhaps a tad more indebted to Ted Nelson than to Tim Berners-Lee.) If anything, GDPR crystallized and acted on decades of mounting unease with which the global middle class watched an increasingly homogenous and overtly neoliberal software industry bully all other industries and governments worldwide. GDPR is hardly a perfect law and I would hate to come across as a cheerleader for it, but it has proven a decisive move to shift the ground under the most powerful industry on earth, disincentivizing business models that it implicitly designated as toxic and protecting a space in which to experiment with new ones. At its best, that space will foment scrappy, righteous alternatives (and yes, more startups, for better or worse) that no one will tell you about on pay-to-play information platforms like Facebook or Google News. Or, to put it another way, that no one has borrowed enough venture capital to pay Facebook and Instagram to spread the word about, at their current rates.

Back to brass tacks: what does an internet built on the basis of self-sovereign identity actually look like? How can you retain “sovereign” control over your data if it’s still being stored on a company’s servers and processed to do cool stuff with it?

First off, there is far less of your data going to each company and the companies aren’t sharing it between them, so there is a lot less data at play in any given transaction. Indeed, limiting each transaction or account’s data access to the “minimum data necessary” is a kind of mantra to the most hard-line SSI thinkers. And to be clear, these hard-liners have a pretty solid point about “SSO” (single sign-on), the user-friendly password workaround that give the google/facebook duopoly access to the lion’s share of personal data on earth. Every account you access “through” a facebook or google account gets lumped in with all the data facebook or google already got from you directly, making the “files” that data brokers legally gather on each of us massive archives by comparison to what the East German secret police kept in filing cabinets on its most suspicious and free-thinking citizens.

Why is it so easy for shady data brokerage companies to collate all your data from diverse sources and sell those composites on the open market legally? Why are there copies of your address and social security number on every server and cloud on earth, if all you really need to prove is age here, address there, valid driver’s license or car insurance in a few cases? How did we get to a place in history where it is so terrifyingly easy to track the location of anyone by their mobile phone number? Not by using the minimum data necessary, for starters, and for the profits to outweigh the costs of misusing data, lying to the people that generate it, and selling it to the highest bidder.

Secondly, your “sovereignty” over the valuable and scarce data you do choose to give out hinges on complex systems of cryptography: the shift in power on the social level is effected by a shift in data structures whereby the access you grant others to your data is enforced by the lending, granting, and changing of “keys” and “locks”. (For the highly non-technical, just remember the worst breakup of your life, whether personal or professional, and that moment when you think to yourself: “What if I change the locks while they’re out?”).

In a cryptographically-enforced data sovereignty model, you prove your identity each time you sign in by presenting a one-time cryptographic “key” that the site then uses to encrypt your data anywhere it is stored or transmitted, aka, anywhere it could be hacked or breached. Your data is meaningless without temporary and revocable access to a key that lives in your private wallet. In the case of most current SSI pilots and betas, that key lives in a MetaMask wallet or something similar– a browser add-on or app that securely stores your key to various accounts. Alternative storage methods exist as well, for more complicated security situations. To put it another way, every time you log off, you pull your key out of the lock and all the core, protected data of your account gets instantly encrypted back into gobbledygook; until you log back in, no one, not even the site itself, should be able to read, much less change or sell, any of it.

The “attack surfaces” for malfeasance or unauthorized access to your data are not reduced to zero, but they are smaller, and enforcing fair play on the remaining ones might only be possible if the core code is, if not completely open, at least periodically reviewed by neutral third parties, be they governments, competitors, or white-hats. From the end-user’s point of view, all this translates to a simpler social contract: you have at no point traded or “sold” your data to a company that doesn’t have powerful access to it even while it’s processing it. You have rented your data out to one party, and you can effectively erase it yourself without logging in to that party’s platform, simply by revoking the access credentials you issued to the site.

This might seem like an overzealously individualistic foundation for a new digital economy, but many see it as more akin to double-entry bookkeeping, forcing the designers and builders of data systems to factor user privacy and more thoughtful data design into their core architecture rather than dismissing data breaches and complex user needs as “externalities” or “corner-cases” for the lawyers to worry about later. As anyone who has worked in information security can tell you, the vast majority of data systems were not (and are still not) designed with security or privacy as core functions; instead, they are built insecurely, shown to investors, and then at the last minute, just before rushing them to market, they have flimsy security and privacy controls “bolted on” after the fact.

This organizational dysfunction within the industry might have been less consequential in earlier moment in the history of capitalism, but in this one, there is a worldwide shortage of qualified information security professionals keeping those controls bolted on, and a rapidly growing black market for, shall we say, informal information security professionals trying to loosen those bolts day in and day out. “Web 2.0”, as some people call our current dominant model of social media and other platforms powered by personal data and surveillance, has evolved from a global village to a hacker’s paradise, a data broker’s fire-sale, and a private, cautious person’s hellscape in less than two decades.

Scrapping it all and starting over is starting to sound like a good idea to whole swathes of the world’s population. If we get lucky, Web 3.0 will be a lot less vertically-consolidated, powered by collectively-governed systems and peer-to-peer networks, with a lot less robber-barons and stock-rich oligopolies moving fast and breaking shit. Maybe the coming internet will be a much smaller, quieter, slower, and more humble internet that, plainly put, just does less and disrupts nothing at all. I can almost guarantee it will be drastically less convenient, at least for the first generation or two. But if that’s the price of getting the teeth of the data vampires out of our necks, it might be worth a shot. Maybe we’re drunk on seemingly boundless convenience, and it would do us some good to build something a little more slowly and empathetically for a change.

Self-Sovereignty and Autonomy

2018-10-28T00:00:00+00:00

Note: This piece was slightly adapted from the company blog of the self-sovereign identity (“SSI”) concern, Spherity GmbH, where I worked at the time.

Abstract

One of the trickiest concepts to explain about SSI is the definition and scope of “sovereignty”. To do so, we’ll start by defining the term in the abstract, and then in contrast to related, near-synonymous terms. Then we’ll explain why we use the term at Spherity, and how “self”-sovereignty and the related term “autonomy” work for the non-human identities we discussed in the previous post.

Sovereignty

Sovereignty is a term borrowed from the political sciences and rarely used outside of them. Its meaning is similar to (but not synonymous with) control, rule, or power over others. A king or queen, or an administration working in their name, is said to be “sovereign” over each subject, who must work within and under that sovereignty or be banished from the land. To put it another way, control, rule, or power can be renegotiated over time and in different contexts, but sovereignty is existential, definitional, and inherent. In software terms, you could say sovereignty is the protocol layer of politics.

Given this backstory from the history of politics, “self-sovereignty” might seem a hyperbolic term to use for a system of better data controls, and yet, here we are, over ten years into writing about software using this term. Couldn’t we just call what we are building “self-managed data” or “self-administered identity,” one might reasonably ask oneself? We could, of course, and in many contexts we do, because not every application of this technology has as its goal the total data sovereignty of all participants. Some smaller, local applications have a more limited scope, like managing data better in a given context. Similarly, there are contexts where the political connotations of the term “sovereignty” can get in the way, as meeting with regulators from sovereign federal banks or sovereign wealth funds. In these contexts, it can be strategic to refer to an SSI data system by another, near-synonymous term.

One of the key organizations working in SSI is called the Decentralized Identity Foundation, taking its name from the nearest neighbor to SSI, “decentralized identity.” This term is often used in the context of the so-called “Web 3” movement, which includes some coalitions between businesses and civil society that work toward a more censorship-resistant and individual-centric internet, and in some cases even a more capital-resistant one. Most of the key reference points in decentralized identity are SSI schemes, although SSI is not the only decentralized system of identities; proof-of-personhood systems like Idena, for instance, do not encode data or make it verifiable to third parties, they simply allow identity to be self-asserted in a decentralizing way.

Similarly, many people refer to SSI as “blockchain identity,” since most existing SSI implementations anchor identities to blockchains or digital ledgers, and because blockchain-based businesses are more likely than most web business to need an identity layer for stability, compliance, and security without relying directly on credit cards and banks. There are cases (some of Ethereum’s identity-encoding ERCs, for example) in which a blockchain might be used to anchor self-managed identities without necessarily making data signed by those identities externally verifiable. For this reason, we believe using a blockchain to self-manage identities is not enough to make an identity fully “self-sovereign”; some kind of worldwide standard needs to be put in place to make data portable enough and verifiable enough outside the domain of the anchoring blockchain.

Examples of true self-sovereignty are rare in human history. Mostly, politics is messy and compromised.

So with all these handy near-synonyms, why stick to the loaded and individualistic image of a self-crowning monarch, existentially empowered to command all of his or her data? We believe all of the seemingly near-synonymous terms we’ve mentioned above leave out one or another key feature of self-sovereignty: identity must be self-issued, but it must also be portable and verifiable, in a way agnostic to specific networks or blockchains.

Why and How Spherity uses the term “sovereign”

The reason Spherity and other SSI practitioners stick to such an emphatic framing metaphor is that we share with the early SSI thinkers a commitment to existential, definitional, and inherent change in the way data is handled at the lowest levels. We do not want contextual or negotiable shifts in data sharing regulations or in the design of Terms of Use contracts. We want foundational changes in the “ground rules” of data power: instead of a “right to be forgotten,” which would be submitted as a request to the still-sovereign Data Controllers who might or might not honor such a request, we want subjects of data to get direct access to a “delete” button (or, to be more precise, a cryptographical “forget key” button). Instead of a promise to only share a subject’s data in the ways originally agreed to, we want a form of data that cannot be shared in any other way, because the sovereignty of that subject is encoded in the 1s and 0s of the protocol by which the data itself is lent and transferred. Shifting that much power back into the hands of the data subject could well make kings and queens obsolete, and by extension, today’s Data Barons and “identity providers” might cease to be relevant as well.

But then, who is the “self” in self-sovereignty, particularly when such a wide range of non-human selves can hold identities? To extend the argument of the first entry in this series, anything with an identity can have a self-sovereign identity in data terms, even a machine or an algorithm or a car or a smart device on the dumber end of the spectrum. Applying the very human-centric concept of “sovereignty” to robots and algorithms might be a little premature in 2019, however. Instead, we find it more appropriate to speak of the “autonomy” of these non-human entities, which can act in the physical and digital world without necessarily demanding such human rights as privacy protections and a right to be forgotten. Those “human” aspects of the SSI paradigm are instead exerted by the human “controllers” (owners, administrators, leasors, etc) to which these functions are delegated in our system, or in any other SSI system that has taken care to architect in these complexities.

Detail from Spherity’s website graphics by Mariüs Goebel In many cases, the sovereign “self” is an autonomous piece of software or hardware. All this might sound a little less abstract with an example or two. Think, then, of a smart car that automatically send self-diagnosis reports to a maintenance network, or a factory robot that is owned and operated by an OEM manufacturer, with different technicians authorized to reprogram it or fine-tune its algorithms over time. The owner of a car and the operator of an industrial robot operate on the data of these autonomous machines in a powerful way through an SSI interface, which checks their credentials and leaves a detailed audit trail of all such operations. The rest of the time, however, these cars or robots are practicing their autonomy unfettered.

Day in and day out, autonomous machines make “decisions” about their own physical-world functionality, shutting down to avoid damage or finding more efficient ways to do their work. They can make “decisions” in the digital world as well, such as whether to establish trusted communications with a smart traffic light or a third-party freight delivery robot. On the other hand, more nuanced legal and business questions about data privacy, licensing fees, or consent to data usage, will probably remain “out of scope” for the programming of such devices for decades to come.

The Sovereignty of Autonomous Things

Real-world decisions, particularly legal and business decisions, are now, and will probably always be, taken in the human world by the legal department or the financial department of the company that owns and operates the device. The outcomes of these decisions can still be explicitly defined in machine-readable values and settings, which can be stored in the machine’s decentralized, portable data object for all trade partners with a legitimate stake in that data, and all future interlocutors in the lifecycle of the machine. In this sense, while full sovereignty over the machine’s data might require a little help from human controllers to exercise, the data itself still needs to be fully portable and accessible to all authorized parties, regardless of the current owner. In this sense, the machine’s data can be fully sovereign even if it needs outside intervention to change or process all of it. (More concrete, detailed examples will follow in the industry-specific 201 posts guiding you through the complexities of delegated data control and machine life-cycles.)

The “autonomy” of a self-driving car or a self-optimizing production robot is thus complex enough to require a fully-formed, programmable and interoperable digital identity, even if some of the “human-only” data functions still happen “upstream” at the level of rightful human controllers. This will likely not change for decades, at least not until truly “sovereign” machines and algorithms begin to replace merely “autonomous” ones decades from now. While a lot more could be said about different schools of thought within “self-sovereign identity” and how Spherity’s approach to non-human identities differs from that of other companies, we can return to these later in a later, more focsed post about terminological strategy.

cd ~

Standards and Monoculture: The Curious Case of Multiformats

Abstract

Protocol, Optionality and Translation

What is a standard anyways?

What kind of useful is multiformats, then?

Diversity and Resilience

Coalition and Collusion

Standards of Independence

Competition and Protectionism

Standardization as the Normalization of an Impersonal Passive Voice

This Is What Community Governance Looks Like

ActivityPub and Newsrooms: Thinking Past Platforms

The Great Exodus That Wasn’t

Actual Exodus Data

What We Lose When We Lose Twitter

The Cowpath Beneath the Platform

Nothing Scares Off Capital Like Utility Regulation

Servers and Sources of Truth

Each Server a Variously-Unbundled Microduchy

Sidebar: The Alternate Fediverse of BlueSky

Threads - Selling Picks and Shovels from the Center of the Chessboard

Oops I Forgot To Mention Another Big Chunk of the Fediverse

Oops I Have Even More Fediverses To Explain

An Actual Everything App, or even an Embarassment of Overlapping Everything Apps

The Beach Beneath the Pavement

Show me your papers, Mr. DAO

Context

Problems

Provocation: Opt-in Civilization

Provocation: Opt-in Globalization

Provocation: Identity Within and Beyond a Blockchain

Provocation: Ships of Theseus All the way Down

Provocation: What even does anyone mean when they say “DID”?

Historical Verifiability of Mutable [DID] Documents

The Shape of a Solution

Summary of Requirements above

Relationship to the daoURI() interface

daoID Document as Microledger of daoURI Documents

Solution-Shape A: Back-Linking daoURIs

Solution-Shape B: Off-Linking from daoURIs Documents to daoID Documents

Solution-Shape C: Completely Separate Resolution Mechanism

Proposals

Acknowledgements

References

Copyright

Of Grammatology: IPFS as Language Family

From Principles to Systems Made Up of Components

Content-Addressable Systems for Graphing Data that May or May Not Be Interplanetary File Systems

Excursus: iroh

Mapping the Minimal End-to-End IPFS System

Abstracting out public data networks from the Amino “Mainnet”

Excursus: LUCID

Abstracting out the IPFS from the Web as a URL-to-resource key/value store

Excursus: web3up, CAR Envelopes, and Content Credentialing

Abstracting Out Determinism from Optionality

Which way, modern file system?

Regime Change

Ingress Dialects

Networking Dialects

Data Circulation Dialects

Fetch Dialects

Excursus: Segmented NFT Example System

Next Steps

Life Cycle of a Blockchain Standard

Preface

The Pitch: Define a Common Standards Lifecycle and tailor it to each community

Stage 1: Initial Idea and competing draft proposals (“finding dance partners”)

Stage 2: Prototyping <> Spec-Writing Stage (“crystalizing the proposal”)

Checklist for a succesful specification project

Stage 3: Editing in the open (“hardening the specification”)

Stage 4: A more robust Review phase (“Landing the consensus plane”)

Stage 5: Standards outlive their founding documents (“Update, Iterate, Inform”)

Next Steps

Acknowledgements

Spinning Up the SocialWeb Coöp

The years they keep on passing

Building a social web socially

The Social Web, warts and all

Sovereignty and Technology

Excursus: `iroh`