<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom"><generator uri="https://jekyllrb.com/" version="3.9.0">Jekyll</generator><link href="https://sambleckley.com/feed/writing.xml" rel="self" type="application/atom+xml"/><link href="https://sambleckley.com/" rel="alternate" type="text/html"/><updated>2026-05-13T14:03:38-04:00</updated><id>https://sambleckley.com/feed/writing.xml</id><title type="html">Sam Bleckley | Writing</title><subtitle>Sam is a software engineer, designer, and artist. This is Sam's blog.</subtitle><entry><title type="html">The Consolation of Discernment</title><link href="https://sambleckley.com/writing/discernment.html" rel="alternate" type="text/html" title="The Consolation of Discernment"/><published>2025-02-21T00:00:00-05:00</published><updated>2025-02-21T00:00:00-05:00</updated><id>https://sambleckley.com/writing/discernment</id><content type="html" xml:base="https://sambleckley.com/writing/discernment.html">&lt;p&gt;Today I’m thinking about discernment.&lt;/p&gt; &lt;p&gt;In 1521 Íñigo López de Oñaz y Loyola, successful soldier and man-about-town, is finally injured badly in battle: his leg is fractured terribly by a cannonball. Sitting in bed, he daydreams about his future life. All the daydreams are fun and engaging — why daydream about boredom and unhappiness? — but sometimes his fantasy ends and he’s left engaged in the rest of the day enlivened and full of energy, and other times he’s left restless and hollow. After some experimentation and interior observation, he calls the former feeling &lt;em&gt;consolation&lt;/em&gt; and the latter &lt;em&gt;desolation&lt;/em&gt;; and finds they apply to all sorts of thoughts and actions, memories and plans. He gives the name &lt;em&gt;discernment&lt;/em&gt; to the process of paying attention to which things result in consolation and which result in desolation.&lt;/p&gt; &lt;p&gt;Íñigo goes on to have holy visions and attribute consolation to the spirit of god and desolation to spiritual evil. He redirects his life and is eventually remembered as St Ignatius of Loyola, founder of the Jesuit order.&lt;/p&gt; &lt;p&gt;Those additional steps are culturally contextual; feel free to take them if they appeal to you, but they’re not necessary in order to make that initial observation: some things that are nice while they occur leave me feeling empty and hollow afterward, and some things leave me energized and full of delight even after they’re done. Discernment is a functional mental tool outside any religious context.&lt;/p&gt; &lt;p&gt;When my life is going well, and my daily actions are mostly part of larger plans, I don’t feel much need for discernment. I’m taking step 17 of 25 on my way to a goal, and perfectly satisfied. It is when life is somewhat rougher, when there is no plan, that discernment comes into its own. At moments when &lt;em&gt;most&lt;/em&gt; actions feel like they will be &lt;em&gt;mostly&lt;/em&gt; futile — when it’s unclear if there is any way to achieve worthwhile goals — discernment still functions very effectively at picking one course of action over another. Discernment ignores the ends and judges the means based on what they do &lt;em&gt;internally&lt;/em&gt; rather than externally. Even if you’re Bill Murray in Groundhog Day and all your efforts will be washed away like sandcastles, consolation and desolation can provide some kind of compass.&lt;/p&gt; &lt;p&gt;(There is some relation to &lt;a href=&quot;https://sambleckley.com/writing/local-and-global-motivation.html&quot;&gt;global motivations&lt;/a&gt; here, though discernment is nearly useless for storytelling purposes or team morale.)&lt;/p&gt;</content><author><name></name></author><category term="thinking"/><summary type="html">Today I’m thinking about discernment.</summary></entry><entry><title type="html">The Inefficient Courtroom</title><link href="https://sambleckley.com/writing/the-inefficient-courtroom.html" rel="alternate" type="text/html" title="The Inefficient Courtroom"/><published>2025-02-19T00:00:00-05:00</published><updated>2025-02-19T00:00:00-05:00</updated><id>https://sambleckley.com/writing/the-inefficient-courtroom</id><content type="html" xml:base="https://sambleckley.com/writing/the-inefficient-courtroom.html">&lt;p&gt;Today I’m thinking about trial by jury and efficiency.&lt;/p&gt; &lt;p&gt;In a courtroom, everyone has a specific role. Each lawyer is supposed to make the most effective and compelling argument they can, regardless of who they imagine to be guilty. The judge is supposed to constrain those arguments to the law, regardless of who they imagine to be guilty. Outside the courtroom, there’s an entire representative democracy making that law, which must apply to everyone, regardless of who the lawmakers imagine to be guilty. The jury is supposed to assess those arguments and come to a conclusion about the arguments, not the people involved. The stenographer is recording the proceeding so it can be vetted, and the decision appealed if anyone hasn’t done their job.&lt;/p&gt; &lt;p&gt;In reality, there are plenty of situations where the judge alone makes a (&lt;em&gt;ahem&lt;/em&gt;) judgment call; but in the perfect vision of a jury trial, no one person makes the entire moral decision of right or wrong, guilty or not guilty, legal or illegal. Everyone has some smaller, less subjective job to do, and the final verdict is the sum of all those jobs without being  exactly equal to any of them.&lt;/p&gt; &lt;p&gt;In this way, the court is a &lt;a href=&quot;https://plato.stanford.edu/entries/chinese-room/&quot;&gt;Chinese room&lt;/a&gt;. It is a system for making those  decisions that is, in principle, more resilient than any single person, and indeed in spite of any single person. A person can be corrupt, or lazy, or biased; and even if it were humanly possible for a perfect judge to exist, undying, and able to preside over every dispute, our society is varied enough that there is no single ideal of moral judgment for that person to live up to.&lt;/p&gt; &lt;p&gt;A courtroom is, for that reason, much less efficient than a single person making decisions! A court is slow and plodding and expensive and resource-intensive. Robustness is redundancy, and redundancy is expensive.&lt;/p&gt; &lt;p&gt;I think this is a feature common among almost all long-lasting positive institutions; they are not efficient, they are robust. They work slowly and change slowly, and that slowness is, if not a feature in itself, a direct result of the most important feature.&lt;/p&gt; &lt;p&gt;&lt;a href=&quot;https://fs.blog/chestertons-fence/&quot;&gt;Chesterton’s Fence&lt;/a&gt; urges us to understand the purpose of what we’re replacing but I think it is easy to look at slow robust systems and say “I understand the input and output of this process, and I can do it much faster and cheaper!” The robustness of social systems is a secret, hidden feature because the corruption and conflict it counteracts is rarely visible — because it’s ineffective, it goes away on its own! But if you remove that robustness, the corruption will return the moment your back is turned, and recreating the slow, plodding, but robust system is far harder than tearing it down — especially when it was built over centuries of real-world learning.&lt;/p&gt;</content><author><name></name></author><category term="thinking"/><summary type="html">Today I’m thinking about trial by jury and efficiency.</summary></entry><entry><title type="html">One Song in Your Musical Is Wrong</title><link href="https://sambleckley.com/writing/one_song_is_wrong.html" rel="alternate" type="text/html" title="One Song in Your Musical Is Wrong"/><published>2025-01-31T00:00:00-05:00</published><updated>2025-01-31T00:00:00-05:00</updated><id>https://sambleckley.com/writing/one_song_is_wrong</id><content type="html" xml:base="https://sambleckley.com/writing/one_song_is_wrong.html">&lt;h2 id=&quot;i-beauty-and-the-beast-1991&quot;&gt;I. Beauty and the Beast (1991)&lt;/h2&gt; &lt;p&gt;Many musicals have an “I Wish” song — a song that establishes early on what the protagonist feels is missing from their life, a desire the story will both fulfill and complicate.&lt;/p&gt; &lt;p&gt;In Disney’s animated Beauty and the Beast, Belle’s “I Wish” song is very short and says, roughly: “I don’t want to be a little wife; I want adventure in the wider world; I want much more than is planned for me.” She releases dandelion seeds into the wind over a sun-drenched vista.&lt;/p&gt; &lt;p&gt;Belle’s story: she gets locked in the darl castle down the street, and eventually marries the guy she finds there.&lt;/p&gt; &lt;p&gt;What is most fascinating, to me, about that contrast is that there’s no need to change the story. The whole film would work fine if you swap out those 30 seconds of song, instead. She could wish for a quiet place to read, someone to talk to, and to not be mocked for her imagination — that would work with all the other evidence! And it would work in the “oh, getting my wish is not making me content” way, at first, too!&lt;/p&gt; &lt;p&gt;Instead, she sings an “I Wish” song that might reasonably suit the story of The Little Mermaid or Mulan — a story with travel and adventure in foreign lands.&lt;/p&gt; &lt;h2 id=&quot;ii-wicked&quot;&gt;II. Wicked&lt;/h2&gt; &lt;p&gt;Many musicals have a show-stopping number involving a character girding their loins to make an important choice. In Wicked, that number is Defying Gravity. But as much as Defying Gravity is a banger, it always felt &lt;em&gt;thematically&lt;/em&gt; unearned to me. What does gravity, or flying, have to do with the emotional journey of any of the characters? What does the metaphor built by the lyrics have to do with the internal decision Elphaba is making?&lt;/p&gt; &lt;p&gt;It’s the right moment in the musical for a song like that — the &lt;em&gt;emotion&lt;/em&gt; is earned — but the actual content of the song is more related to this tiny bit of plot machinery (a spell that makes things fly) instead of any central theme or aspect of character.&lt;/p&gt; &lt;p&gt;The concept of defying gravity absolutely &lt;em&gt;could&lt;/em&gt; have thematic resonance: she’s defying the laws of physics AND the laws of man. So what could you change leading up to it to make that connection work?&lt;/p&gt; &lt;p&gt;You could make two changes:&lt;/p&gt; &lt;ul&gt; &lt;li&gt;make Elphaba extremely rules-bound and Glinda “rules are for other people”; let that drive their conflict&lt;/li&gt; &lt;li&gt;and make flying a forbidden branch of magic&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;In that adjusted story, Elphaba seeing the cruelty of the laws towards non-humans, deciding to be an outlaw, AND experimenting with flight all thematically act together, and then whether or not to “Defy Gravity” can effectively represent the choice both characters are making.&lt;/p&gt; &lt;p&gt;Admittedly, equating “defying the laws of physics” with “defying the rules” could cause other problems, given that the head of state is A Wizard.&lt;/p&gt; &lt;p&gt;So, alternately, rather than change the story to suit the song, how could the song be adapted to better connect to the story given the themes that are already there?&lt;/p&gt; &lt;p&gt;The first act is almost entirely about perception: being seen as good vs being good; being seen as strong, or beautiful, or liked, or beloved vs actually &lt;em&gt;being&lt;/em&gt; any of those things.&lt;/p&gt; &lt;p&gt;A physics metaphor about giving up on being perceived as something? That’s easy! Try one of these:&lt;/p&gt; &lt;ul&gt; &lt;li&gt;“I think I’ll try superposition”&lt;/li&gt; &lt;li&gt;“I think I’ll try quantum coherence”&lt;/li&gt; &lt;li&gt;“I think I’ll exist in several eigenstates”&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;The scansion and prosody of these options are, frankly, unassailable, and the language widely accessible. So that’s how I’d fix Wicked’s hit song and my version would be way more relatable and popular than the dud they’ve got, which no one likes.&lt;/p&gt; &lt;h2 id=&quot;iii-sheesh-sam-what-are-you-talking-about&quot;&gt;III. Sheesh, Sam! What are you talking about?&lt;/h2&gt; &lt;p&gt;I think a lot of my acquaintances think of me as a critical and negative person because I tend to think and talk about things I think don’t work, even when they’re a small detail in an otherwise successful whole (one short song, or even one lyric, in a popular and critically acclaimed musical, to pick an example at random.)&lt;/p&gt; &lt;p&gt;That’s not because I want to tear others down, though: I do exactly the same to my own work, both the technical and the artistic! Nor is it because I am a perfectionist — I am happy to release “good enough” into the world and work from there. My nitpickiness comes from just this: to me, how things fail is the most interesting part. That is the meat worth chewing on.&lt;/p&gt; &lt;p&gt;There are &lt;em&gt;sometimes&lt;/em&gt; things to be learned in analyzing something that works well, to see how it works; but most often what you discover in the analysis is “it works because it follows guiding principles I already know.”&lt;/p&gt; &lt;p&gt;There is &lt;em&gt;always&lt;/em&gt; a practical lesson available in examining something that doesn’t work, and investigating how it could be improved; because even when you discover that it is most easily improved by making it adhere to the principles you already know, you get &lt;em&gt;the practice of mentally doing that work&lt;/em&gt;.&lt;/p&gt; &lt;p&gt;This urge to pick at the scabs of any undertaking has, in many ways, led to my current approach to consulting. I rarely accept a contract longer than 6 months. Why? Because while your teams and processes will grow and improve under my constant prodding at even small dysfunctions, they will not &lt;em&gt;stabilize&lt;/em&gt;. To make a beautiful bonsai, you train and prune it aggressively &lt;em&gt;and then leave it alone.&lt;/em&gt; I am constitutionally very good at the former and poor at the latter; so I try to provide the nitpicking as a service, rather than fight my instincts in order to effectively serve a long-term placement.&lt;/p&gt; &lt;p&gt;Still, it is good for me, sometimes, to play against type and spend some time admiring what works. If you’re my opposite and tend to shy away from analyzing smaller losses inside larger successes, let’s trade for a bit: I’ll set my nitpicks aside for a bit, and you try to see the failures as the most interesting part.&lt;/p&gt;</content><author><name></name></author><summary type="html">I. Beauty and the Beast (1991)</summary></entry><entry><title type="html">Why I Rewrote a Century-Old Artist’s Anatomy Book</title><link href="https://sambleckley.com/writing/why-i-rewrote-bridgman.html" rel="alternate" type="text/html" title="Why I Rewrote a Century-Old Artist’s Anatomy Book"/><published>2025-01-01T00:00:00-05:00</published><updated>2025-01-01T00:00:00-05:00</updated><id>https://sambleckley.com/writing/why-i-rewrote-bridgman</id><content type="html" xml:base="https://sambleckley.com/writing/why-i-rewrote-bridgman.html">&lt;p&gt;&lt;img src=&quot;/assets/images/bridgman-figure.png&quot; alt=&quot;Two figure drawings and an anatomy study of the arm by George Bridgman&quot; /&gt;&lt;/p&gt; &lt;p&gt;(If you’re not interested in the how or why of this project, and just want the result, it’s available &lt;a href=&quot;https://www.amazon.com/dp/B0DQRDGK9C&quot;&gt;here&lt;/a&gt;.)&lt;/p&gt; &lt;p&gt;Bridgman’s &lt;em&gt;Constructive Anatomy&lt;/em&gt; is a classic study text for artists that has been in print for over a century. It’s time-tested and artist-proven: comic book artists, illustrators, and fine artists all study Bridgman. Other anatomy books may be more complete, more technical, more clear, or more accurate — but Bridgman’s drawings look like art. Any attempt to improve or correct Bridgman would be foolish! An act of hubris!&lt;/p&gt; &lt;p&gt;Last year, I set out to improve and correct Bridgman.&lt;/p&gt; &lt;p&gt;In my defense, this project didn’t start with that goal. I only wanted to do a more in-depth study of Bridgman for myself. Bridgman writes in a way that’s both dated and pretty convoluted. So, to prove to myself that I was truly digging in and understanding it, I decided to rewrite the text, paragraph by paragraph, into modern words. English-to-English translation: how hard could it be?&lt;/p&gt; &lt;p&gt;(Unavoidably, complications are sure to follow. This is a story of scope creep, as much as it is about writing or drawing.)&lt;/p&gt; &lt;p&gt;Almost as soon as I began, I discovered that some of the texts challenges were more subtle than the individual sentences being difficult to parse.&lt;/p&gt; &lt;h2 id=&quot;the-cryptic&quot;&gt;The Cryptic&lt;/h2&gt; &lt;p&gt;Take, for example, the case of &lt;em&gt;wedging&lt;/em&gt;. Bridgman has this to say, on the very first page of his text:&lt;/p&gt; &lt;blockquote&gt; &lt;p&gt;Masses of about the same size or proportion are conceived not as masses, but as one mass; those of different proportions, in respect to their movement, are conceived as wedging into each other, or as morticed or interlocking.&lt;/p&gt; &lt;p&gt;The effective conception is that of wedging.&lt;/p&gt; &lt;/blockquote&gt; &lt;p&gt;Throughout the rest of the work, he will point out when forms should be thought of as &lt;em&gt;wedging&lt;/em&gt;. Bridgman does not, however, ever define the word; he never illustrates the difference between two masses wedging into each other and any other form of contact.&lt;/p&gt; &lt;p&gt;I studied every time Bridgman used the word, both in this text and in others. I turned to other artists’ attempts at decoding the word. In the end, I came up with an analysis that I believe is effective, and that’s plausibly what Bridgman meant by the word, but there’s simply not enough evidence for my multi-page description, with new illustrations, to feel like a translation.&lt;/p&gt; &lt;p&gt;I hadn’t even gotten through Bridgman’s half-page introduction and already I was forced to think about how to distinguish between his ideas (if rephrased in modern words) and my own additions and analysis. This was the first hint that I would not end this project with a plain text file of notes.&lt;/p&gt; &lt;h2 id=&quot;layout-laid-out&quot;&gt;Layout, Laid Out&lt;/h2&gt; &lt;p&gt;Some problems with &lt;em&gt;Constructive Anatomy&lt;/em&gt; are due to the constraints of affordable bookmaking in the 1920’s: none of the illustrations are in line with the text, and of course, they’re all in black and white. But other issues are deeper and more structural. Bridgman is miserly with his explanations: he rarely explicitly illustrates an idea if he’s described it, and doesn’t always describe an idea if he’s illustrated it. When the text does directly refer to an illustration, they are almost never found on the same page.&lt;/p&gt; &lt;p&gt;I couldn’t just rewrite the text and get anywhere. I had to connect the illustrations to the text, and (where necessary) make their labels easier to follow. I spent a week slicing out each of the 500+ illustrations and re-sorting them based on what they were attempting to show. This was another judgment call; the original text alternates pages of text with pages of dozens of uncaptioned drawings. It’s very likely many of those drawings were intended to demonstrate more than one idea. Moving them in line with the text makes it easier to connect them to one idea, but also easier to ignore any other lessons they might be trying to teach.&lt;/p&gt; &lt;p&gt;Due to the order he visits the body parts, Bridgman is forced to repeat himself to avoid using anatomical names he hasn’t yet defined. Once I was bought in enough to move all the illustrations, reordering the sections seemed obvious and necessary.&lt;/p&gt; &lt;p&gt;It was probably around this point that I started to consider the fact that George Bridgman died over 70 years ago, and his work is in the public domain, now: I could, legally, publish the result of all this work.&lt;/p&gt; &lt;h2 id=&quot;a-product-of-his-time&quot;&gt;A Product of His Time&lt;/h2&gt; &lt;p&gt;Other challenges are due to the social era: Bridgman’s anatomical vocabulary is sometimes outdated, and a few times completely incorrect (to be honest, I remain uncertain where he thought the brachioradialis or levator scapulae belonged; eventually I gave up trying to understand and put them in the correct places). His terminology is old-fashioned, and his anatomical analysis is, at least on one notable occasion, overtly racist.&lt;/p&gt; &lt;p&gt;Some important bits are completely missing. He ignores the female figure entirely – there are feminine faces and hands, but no feminine bodies appear anywhere in the entire book. In a course on anatomy for fine artists, he never mentions the breast. Some of his other books at least include a few female nudes, and I was able to pull in some of them to fill in gaps; but any discussion of how to make a body more stereotypically feminine or masculine in appearance, I had to write from scratch.&lt;/p&gt; &lt;h2 id=&quot;wait-is-this-product-design-again&quot;&gt;Wait, Is this product design? Again?&lt;/h2&gt; &lt;p&gt;As I worked, I began asking other artists about their experiences with Bridgman. I started approaching the project less like a student and more like a product designer. When someone mentioned that their eyes glazed over at the Latin names, I started adding silly mnemonics to help make them easier to hang on to. I adjusted my color palette to be colorblind-friendly and began thinking about how the colors functioned across multiple illustrations, rather than just within one. I thought long and hard about how far I could go while remaining true to the meaning and spirit of Bridgman’s original curriculum.&lt;/p&gt; &lt;p&gt;Bridgman fills an important role, not because he’s the most technically accurate or comprehensive resource — quite the opposite! — but because he makes explicit aesthetic choices about how he wants to draw the human body. Studying his work is only partially about gaining the knowledge of a med student, and much more about discovering and showing your own sense of beauty in the same set of parts. His text has many insights about how to think about (and draw) the human body, and I’ve tried to preserve and expose those insights. But there are lots of texts out there that will name the muscles and bones and visible landmarks. Bridgman is special because his drawings are beautiful, rather than just illustrative.&lt;/p&gt; &lt;p&gt;The more I honed in on the experience I wanted people to have when studying Bridgman, the easier it was to make difficult decisions about what to keep and what could be left in the past.&lt;/p&gt; &lt;h2 id=&quot;its-a-book&quot;&gt;It’s a book!&lt;/h2&gt; &lt;p&gt;I’m proud to say that the result of all this work is now available for purchase, in a luxurious studio reference edition, with full color where necessary and big, note-taking-friendly margins. You can pick up a copy here:&lt;/p&gt; &lt;p&gt;&lt;a href=&quot;https://www.amazon.com/dp/B0DQRDGK9C&quot; style=&quot;display: block; text-align: center&quot;&gt; &lt;img alt=&quot;Cover of Constructive Anatomy&quot; src=&quot;/assets/images/bridgman-cover.png&quot; width=&quot;250&quot; /&gt;&lt;br /&gt; &lt;em&gt;Bridgman’s Constructive Anatomy:&lt;br /&gt; A Modern Revision&lt;/em&gt; on Amazon &lt;/a&gt;&lt;/p&gt; &lt;h2 id=&quot;its-not-a-living&quot;&gt;It’s (not) a Living&lt;/h2&gt; &lt;p&gt;“Wow, Sam.” I hear you say. “Is rewriting and improving public-domain texts a way to earn a living? A passive income?”&lt;/p&gt; &lt;p&gt;No. This project &lt;em&gt;is&lt;/em&gt; turning out to be more successful than I expected; I’ve sold one or two copies a day since I launched it. There was no need for the complicated pre-launch strategies a normal book needs because people are already searching for (and buying) “Bridgman’s Constructive Anatomy.” Because the text is in the public domain, it’s not hard to siphon off some of that traffic. There are a handful of people doing so (less successfully) without having even bothered with improving the text.&lt;/p&gt; &lt;p&gt;But it’s a beer money project. If &lt;em&gt;no one&lt;/em&gt; bought the original &lt;em&gt;Constructive Anatomy&lt;/em&gt; and they &lt;em&gt;all&lt;/em&gt; bought mine instead — complete market saturation — I would still net less than $5000 a year. And even if I wanted to repeat the trick with a different book, to double up that payday, there just aren’t that many books in a similar position: public domain, relatively well-known, &lt;em&gt;and&lt;/em&gt; deeply flawed in ways I have the skills to recognize and correct.&lt;/p&gt; &lt;p&gt;That’s not a disappointment — I’ve known that this wasn’t a money-maker from the start — but I’ve seen a few people get an entreprenurial glint in their eye as I’ve talked about this project. Keep your feet on the ground.&lt;/p&gt; &lt;h2 id=&quot;appendix-runes&quot;&gt;Appendix: Runes&lt;/h2&gt; &lt;p&gt;Almost every illustration from the original book made it into my revision. A number of illustrations from Bridgman’s other texts also made it in and I created a few dozen new illustrations to fill in gaps. A handful of originals, however, didn’t make the cut. Here are a few Bridgman doodles that, even after many months of extensive study, I remain unable to interpret, at least in terms of what lesson they might convey to a student of art or anatomy.&lt;/p&gt; &lt;p&gt;&lt;img src=&quot;/assets/images/rune1.png&quot; alt=&quot;A rune&quot; style=&quot;max-width: 250px&quot; /&gt; &lt;img src=&quot;/assets/images/rune4.png&quot; alt=&quot;A rune&quot; style=&quot;max-width: 250px&quot; /&gt; &lt;img src=&quot;/assets/images/rune2.png&quot; alt=&quot;A rune&quot; style=&quot;max-width: 200px&quot; /&gt; &lt;img src=&quot;/assets/images/rune3.png&quot; alt=&quot;A rune&quot; style=&quot;max-width: 320px&quot; /&gt; &lt;img src=&quot;/assets/images/rune5.png&quot; alt=&quot;A rune&quot; style=&quot;max-width: 250px&quot; /&gt;&lt;/p&gt;</content><author><name></name></author><category term="writing"/><summary type="html"></summary></entry><entry><title type="html">On Tools</title><link href="https://sambleckley.com/writing/on-tools.html" rel="alternate" type="text/html" title="On Tools"/><published>2024-11-19T00:00:00-05:00</published><updated>2024-11-19T00:00:00-05:00</updated><id>https://sambleckley.com/writing/on-tools</id><content type="html" xml:base="https://sambleckley.com/writing/on-tools.html">&lt;p&gt;There are names “tools” many different contexts; there are tools for physical work, tools for mental work, for office work, for technical work, and for teamwork. There are so many things in our lives called “tools” that damaging ideas sometimes sneak in wearing a tool-shaped trenchcoat.&lt;/p&gt; &lt;p&gt;To be a real tool, it must accomplish one of these three transformations:&lt;/p&gt; &lt;ul&gt; &lt;li&gt;Transform work that required many people into work that requires fewer. These are tools like a skid steer (replacing a team of hand-shovelers), or a design system (reducing the custom per-feature design load). In military contexts this kind of tool is called a “force multiplier” — if what required a force of 10 people now only requires 5, the tool is a 2x force multiplier.&lt;/li&gt; &lt;li&gt;Transform &lt;em&gt;hard&lt;/em&gt; work for one person into &lt;em&gt;easy&lt;/em&gt; work for one person. These are tools like a chainsaw, or a targeted single-use library. They’re not quite force multipliers, because a task can never use half a person; but a hard task leaves you exhausted, and an easy task leaves you ready for more.&lt;/li&gt; &lt;li&gt;Transform impossible work into possible work. These are tools like calculus, a welder, or Unreal Engine. Before Leibniz and Newton invented calculus, there were problems which were impossible to solve; now they can be solved by a motivated high schooler.&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;Every one of these tools requires initial expense, education, and ongoing maintenance; they are worth the cost &lt;em&gt;because&lt;/em&gt; they are tools by one of the above three definitions. Things that look and sound like tools but don’t actually make hard work easier are not tools, they’re mistakes. &lt;strong&gt;Don’t introduce complications to your team unless they’re actually tools.&lt;/strong&gt;&lt;/p&gt;</content><author><name></name></author><summary type="html">There are names “tools” many different contexts; there are tools for physical work, tools for mental work, for office work, for technical work, and for teamwork. There are so many things in our lives called “tools” that damaging ideas sometimes sneak in wearing a tool-shaped trenchcoat.</summary></entry><entry><title type="html">The Anatomist’s Fallacy</title><link href="https://sambleckley.com/writing/the-anatomists-fallacy.html" rel="alternate" type="text/html" title="The Anatomist’s Fallacy"/><published>2024-10-29T00:00:00-04:00</published><updated>2024-10-29T00:00:00-04:00</updated><id>https://sambleckley.com/writing/the-anatomists-fallacy</id><content type="html" xml:base="https://sambleckley.com/writing/the-anatomists-fallacy.html">&lt;p&gt;I tend to be shy about using the words “purpose” and “intention”. There are lots of processes that make it tempting to ascribe intention, but which have none. I think the easiest example is evolution. “Wide-spread eyes evolved in prey animals in order to detect stalking predators,” is an easy thought to have, but evolution as a process doesn’t plan; nothing is “in order to”. Everything is “continues because”. A more awkward but precise statement is “wider-spread eyes persist in prey animals in part because they make animals more likely to detect stalking predators and survive to reproduce, but they may provide other advantages, or just provide no disadvantages.”&lt;/p&gt; &lt;p&gt;I call this “the anatomists fallacy” — ascribing purpose to something simply because it’s effective at something, even when it’s the result of a system that might not be capable of intention at all; even when it has dozens of effects, not just the one.&lt;/p&gt; &lt;p&gt;The corrected statement is a mouthful, though, and I can understand why we hear the former more often. When you’re not just observing, though, when you’re trying to &lt;em&gt;manipulate&lt;/em&gt; a complex system, it’s important to think and speak very clearly about how it works.&lt;/p&gt; &lt;p&gt;And natural processes, like evolution, are not the only ones where this applies.&lt;/p&gt; &lt;p&gt;In my eyes, there is a spectrum of intention-having ability. Natural processes obviously have none. A sunset isn’t &lt;em&gt;for&lt;/em&gt; anything. On the other hand, when I am alone in a room, writing this paragraph, I have a fairly well-defined set of objectives, things I want to communicate, results I want to obtain by writing. If we couldn’t describe the conscious and planned actions of a single person as having purpose and intention, then the words wouldn’t have any use at all.&lt;/p&gt; &lt;p&gt;But even human systems, once &lt;em&gt;enough&lt;/em&gt; humans are involved, start to feel similar: the output of big complex systems like corporations, governments, and economies don’t feel like they can be ascribed a single purpose to me. They are the result of hundreds or thousands or millions of people, all with different purposes, acting on incentives and whims, negotiating and working together or against each other. The effects may be better described as just consequences, leaving off the leading adjectives of “intended” or “unintended”.&lt;/p&gt; &lt;p&gt;And between a single human and an entire society are smaller groups – work teams, local organizations, families, friend groups – these are entities one &lt;em&gt;can&lt;/em&gt; draw a reasonably clear line around, and which take actions and have effects. They are in some blurry area; if everyone involved is of one mind, and has communicated clearly, they could easily be acting to one purposeful end. But it’s just as possible that everyone is acting independently, and the resulting group effects are, happy or unhappy, &lt;em&gt;just&lt;/em&gt; effects.&lt;/p&gt; &lt;p&gt;So here is the practical exercise at the end of this philosophical maundering: as you go about your life this week, think about each part of it the way a careful biologist might think about a new anatomical structure — not in terms of what it’s supposed to do, but all the things it actually does.&lt;/p&gt; &lt;p&gt;Don’t ask “What is this meeting for?” — observe, and see what its effect actually &lt;em&gt;is&lt;/em&gt;. Where does information flow? Who feels what, who learns what? What’s the difference after the meeting vs before?&lt;/p&gt; &lt;p&gt;Don’t ask “Is this initiative succeeding?” — Observe what it’s doing, and &lt;em&gt;everything&lt;/em&gt; it’s doing.&lt;/p&gt; &lt;p&gt;Outside of work, consider family dinner, lunch with friends, a traffic jam, a crowded sports bar.&lt;/p&gt; &lt;p&gt;Then, when you see negative &lt;em&gt;effects&lt;/em&gt;, you can think about interventions not in terms of changing people’s minds to want different things, but in terms of nudging the entire system to produce different effects without touching the intentions involved at all.&lt;/p&gt;</content><author><name></name></author><summary type="html">I tend to be shy about using the words “purpose” and “intention”. There are lots of processes that make it tempting to ascribe intention, but which have none. I think the easiest example is evolution. “Wide-spread eyes evolved in prey animals in order to detect stalking predators,” is an easy thought to have, but evolution as a process doesn’t plan; nothing is “in order to”. Everything is “continues because”. A more awkward but precise statement is “wider-spread eyes persist in prey animals in part because they make animals more likely to detect stalking predators and survive to reproduce, but they may provide other advantages, or just provide no disadvantages.”</summary></entry><entry><title type="html">Build a Laboratory</title><link href="https://sambleckley.com/writing/build-a-laboratory.html" rel="alternate" type="text/html" title="Build a Laboratory"/><published>2024-10-07T00:00:00-04:00</published><updated>2024-10-07T00:00:00-04:00</updated><id>https://sambleckley.com/writing/build-a-laboratory</id><content type="html" xml:base="https://sambleckley.com/writing/build-a-laboratory.html">&lt;ul&gt; &lt;li&gt;A product is something someone will pay for.&lt;/li&gt; &lt;li&gt;A viable product is one you can sell for more than it costs to produce.&lt;/li&gt; &lt;li&gt;A minimum viable product is the easiest thing you can make with those properties, where “ease” depends on your specific circumstances: the time, money, resources, and skills you have available to you&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;In those terms, it’s inescapable that an MVP is the best thing for your business to build. It’s nearly tautological; a business can’t exist without a viable product, and the least risky way to get there would be the one with the least cost and complexity.&lt;/p&gt; &lt;p&gt;Nevertheless, in the past year, in meetups, conferences talks, conversations with clients, and online, I’ve heard a fair amount of pushback against the MVP concept, with folks often trying to replace just one letter — recommending a “minimum viable test”, or a “minimum useful product”. And there &lt;em&gt;are&lt;/em&gt; problems: before you build it, you don’t know what people will pay! You don’t know how much it will cost! You don’t know what features it needs and which can be eschewed! Building an entire business is an expensive way to learn that your idea isn’t viable.&lt;/p&gt; &lt;p&gt;But when addressed on its own terms, the idea of an MVP feels unassailable — because it’s tautological, it’s hard to argue against.&lt;/p&gt; &lt;p&gt;In order to address that tension and frustration, I think it might be easier to approach the idea side-on, by translating the concept of the MVP into a slightly different language, where there’s more room to examine it. I have found a lot of value in an experimental mindset: make a hypothesis, find a way to test it, adjust, repeat — so I instinctively turn to that framework when I need a way to think about actions and motivations.&lt;/p&gt; &lt;p&gt;Let me invent some vocabulary to fit that perspective:&lt;/p&gt; &lt;p&gt;A &lt;strong&gt;market hypothesis&lt;/strong&gt; is a guess that a sufficiency of customers will pay some amount for some good or service. (This should not be confused with the broader “efficient market hypothesis”).&lt;/p&gt; &lt;p&gt;An &lt;strong&gt;operations hypothesis&lt;/strong&gt; is a guess that you can produce a product at a given cost (in money, in labor, in time, etc).&lt;/p&gt; &lt;p&gt;A &lt;strong&gt;complete business hypothesis&lt;/strong&gt; is a matching pair of a market hypothesis and an operations hypothesis.&lt;/p&gt; &lt;p&gt;In this language, an MVP is just the simplest experiment that demonstrates the validity of a complete business hypothesis.&lt;/p&gt; &lt;p&gt;Importantly, that experiment may still be too complex, too expensive, or too risky to design or implement straight away. And that’s OK — by framing it this way, we already know that our big guess is made up of smaller guesses; every complex hypothesis is made up of lesser assumptions.&lt;/p&gt; &lt;p&gt;In the original language of the MVP, an MVP is atomic — it cannot be broken into smaller bits without losing its fundamental properties, because it’s already, by name and definition, “minimal”.&lt;/p&gt; &lt;p&gt;But in the experimental mindset, we can envision sub-hypotheses — the assumptions from which our larger guesses are built — and we can design and run experiments to test those sub-hypotheses. A proof of your complete business hypothesis is not the least thing you can do; it’s simply the least you can do &lt;em&gt;to be home free&lt;/em&gt;. Those smaller experiments along the way aren’t viable products, or even necessarily products at all — if what you’re testing is an underlying assumption in your operations hypothesis, you may not even need to put it in front of users.&lt;/p&gt; &lt;p&gt;The &lt;em&gt;end goal&lt;/em&gt; is still always to develop and prove a complete business hypothesis; it’s still just as tautologically true that you’re not a successful business until you have a viable product. Because we’re testing a hypothesis, a disproof is as genuine an outcome as a proof.&lt;/p&gt; &lt;p&gt;User interviews, user tests, market tests, prototypes — all the tools we’re already familiar with can be cast in this experimental light, as ways of testing sub-hypotheses. If an interview or a mockup can cheaply disprove one small assumption on which you based your larger business hypotheses, then you’ve cheaply disproved the whole thing; you know you need to make adjustments without having spent the cost of an entire MVP. (This isn’t a new conclusion, only a new lens: translated back into startup-speak, small, low-risk disproofs are covered by the mantra “fail fast”).&lt;/p&gt; &lt;p&gt;And that is part of the value of this way of thinking: it’s clear that our hypothesis, like most guesses, is &lt;em&gt;probably&lt;/em&gt; wrong; rather than trying to build the easiest thing that is a successful business — rather than “build an MVP” — we want to discover a successful business in the easiest way — we want to &lt;em&gt;develop a correct business hypothesis&lt;/em&gt; as simply as possible.&lt;/p&gt; &lt;p&gt;Once we acknowledge that our first experiment won’t be the last, the sensible thing to do is to plan for many experiments. A smart inventor doesn’t build a new workshop for every prototype; a smart scientist doesn’t buy all new equipment for every experiment.&lt;/p&gt; &lt;p&gt;So don’t build an MVP; build a laboratory.&lt;/p&gt; &lt;p&gt;Don’t build a product; build yourself tools that will allow you to keep testing and improving hypotheses until you find ones that work.&lt;/p&gt; &lt;p&gt;A lot of my clients balk at the idea of building a componentized design system; but if a slick brand or elegant UI are part of your market hypothesis, then a design system is a UI laboratory. Custom-built one-off UIs are useful only for a single experiment; a design system turns your product into an optical table, where you can stage a thousand experiments using the same building blocks.&lt;/p&gt; &lt;p&gt;A lot of my clients, especially those with small teams don’t want to spend time and energy on robust feature flagging; it doesn’t feel like progress the same way that actually building features does. But being able to add, remove, and recombine entire features turns the product into an inventor’s workbench. It seems too expensive for an MVP, but it’s an obvious step for cheaply running a series of experiments.&lt;/p&gt; &lt;p&gt;(Note that I’m explicitly &lt;em&gt;not&lt;/em&gt; talking about A/B testing when I am talking about “experiments” — that is a kind of experiment, but it is more useful for refining and improving an already successful business hypothesis. A/B testing can extract a weak signal from a noisy channel, which is great when you’ve got a real business already, but tight margins and high volume. The gap between a disproven complete business hypothesis and a proven one, on the other hand, is usually visible to the naked eye. So when I say “build a laboratory” I don’t mean “construct a set of rigorous statistical tools like giant enterprises use”; I mean “provide yourself with building blocks that are easy to assemble in lots of ways, and a platform for exposing the results to users where you can watch what happens”. This is part of the difference between a “scientific” mindset and an “experimental” one — I am advocating for the latter.)&lt;/p&gt; &lt;p&gt;If this way of thinking seems appealing, here are some questions to ask yourself:&lt;/p&gt; &lt;ul&gt; &lt;li&gt;What are some ways in which your current project is like a laboratory? What experiments have you anticipated and left room for?&lt;/li&gt; &lt;li&gt;What are some ways in which your current project is a one-off? If it turns out to be unsuccessful, what will you have to throw away?&lt;/li&gt; &lt;li&gt;Reframe your business as a complete business hypothesis; how would you say it? What are the most vulnerable assumptions underlying that hypothesis? Are you testing those assumptions in the easiest way? Or are you building the machinery of a business while leaving the assumptions untested?&lt;/li&gt; &lt;/ul&gt;</content><author><name></name></author><category term="software"/><summary type="html">A product is something someone will pay for. A viable product is one you can sell for more than it costs to produce. A minimum viable product is the easiest thing you can make with those properties, where “ease” depends on your specific circumstances: the time, money, resources, and skills you have available to you</summary></entry><entry><title type="html">AI in 2024: Eat the fruit, leave the rind</title><link href="https://sambleckley.com/writing/AI-2024-III-stop-generating.html" rel="alternate" type="text/html" title="AI in 2024: Eat the fruit, leave the rind"/><published>2024-08-31T00:00:00-04:00</published><updated>2024-08-31T00:00:00-04:00</updated><id>https://sambleckley.com/writing/AI-2024-III-stop-generating</id><content type="html" xml:base="https://sambleckley.com/writing/AI-2024-III-stop-generating.html">&lt;p&gt;In my opinion, the most exciting uses of LLMs don’t involve generating text at all.&lt;/p&gt; &lt;p&gt;As one step of guessing what the next word of a text might be, existing LLMs turn all the previous text into a set of numbers.&lt;/p&gt; &lt;p&gt;There are the same number of numbers, no matter how short or long the text (with a strict upper limit of the context window). If 5 words produce 50 numbers, 500 words also produce 50 numbers – a different 50, but still 50 of ‘em. These numbers are called a ‘vector embedding’ of that text.&lt;/p&gt; &lt;p&gt;You can measure how similar or how distant those vector embeddings are, and that corresponds well to how similar or different the texts are – not just in a basic “how many words do they share” kind of way, but in a “do they mean the same thing; are they about the same topic” kind of way.&lt;/p&gt; &lt;p&gt;Multi-modal models can embed text, images, and other kinds of data all in the same space, allowing that measurement between very disparate sources.&lt;/p&gt; &lt;p&gt;That’s really cool! That’s a big deal! The ability to pick out related documents, to search by meaning instead of by word, to cluster documents in high-dimensional space… that’s an incredible, exciting tool, and one that can be used disconnected from generation entirely.&lt;/p&gt; &lt;p&gt;Most tutorials suggest using vector embeddings as a search engine, concealed behind retrieval augmented generation – but I say, why bother with the generative part? A more effective search is the win; allowing a generative process to rephrase the results is mostly risk with little to no upside.&lt;/p&gt; &lt;ul&gt; &lt;li&gt;Vector embeddings can’t produce false text, because they only organize the text you give them.&lt;/li&gt; &lt;li&gt;Vector embeddings can’t secretly produce copyrighted content because they don’t produce content.&lt;/li&gt; &lt;li&gt;Vector embeddings can still help your users find the answers they’re looking for, but you have complete control over the wording of the results&lt;/li&gt; &lt;li&gt;They have much lower energy costs than generation&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;Every major LLM provider also offers an API for vector embeddings, and they’re cheap as dirt because they require far less compute than generation.&lt;/p&gt; &lt;p&gt;And that’s it for part III! You’ve now got a pretty complete picture of my opinions about large-model AI in 2024:&lt;/p&gt; &lt;ul&gt; &lt;li&gt;&lt;a href=&quot;/writing/AI-2024-I-bullshit.html&quot;&gt;only use generation when bullshit is acceptable&lt;/a&gt;&lt;/li&gt; &lt;li&gt;&lt;a href=&quot;/writing/AI-2024-II-arguments.html&quot;&gt;know which arguments against it have weight&lt;/a&gt;, and&lt;/li&gt; &lt;li&gt;vector embedding is cheap, low-risk, and underutilized&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;If you’d like to discuss how any of these thoughts relate to your team, product, and business, reach out!&lt;/p&gt;</content><author><name></name></author><category term="software"/><summary type="html">In my opinion, the most exciting uses of LLMs don’t involve generating text at all.</summary></entry><entry><title type="html">AI in 2024: Making a Case</title><link href="https://sambleckley.com/writing/AI-2024-II-arguments.html" rel="alternate" type="text/html" title="AI in 2024: Making a Case"/><published>2024-08-26T00:00:00-04:00</published><updated>2024-08-26T00:00:00-04:00</updated><id>https://sambleckley.com/writing/AI-2024-II-arguments</id><content type="html" xml:base="https://sambleckley.com/writing/AI-2024-II-arguments.html">&lt;p&gt;I hang out with writers and artists along with technical people, and the conversation frequently turns to generative AI, both the text and image kind. The sentiment is rarely positive, and I am sympathetic. I think generative AI is overhyped, the wrong choice for many situations, and is already costing people their livelihoods.&lt;/p&gt; &lt;p&gt;I am a pedant, though, so while I am sympathetic, I get frustrated when I hear that negative sentiment carried by specious and ineffective arguments.&lt;/p&gt; &lt;p&gt;My favorite reason to be cautious is the one I gave last week — &lt;a href=&quot;https://sambleckley.com/writing/AI-2024-I-bullshit.html&quot;&gt;What Good is Bullshit?&lt;/a&gt; — but today I thought I’d take on some other ways people have of saying “generative AI is a problem”, and fine-tune those arguments to align with the truth as best I understand it.&lt;/p&gt; &lt;p&gt;Tired of this relentless negativity? In Part III, I’ll change my tune and talk about some of the LLM use cases I’m most excited about (mostly ignoring the generative aspect entirely).&lt;/p&gt; &lt;h2 id=&quot;llms-cost-too-much-energy&quot;&gt;“LLMs Cost Too Much Energy”&lt;/h2&gt; &lt;p&gt;This is, I think, mostly a hold-over argument from fighting against crypto. LLMs &lt;em&gt;do&lt;/em&gt; cost energy: training, especially, is expensive, and individual queries are certainly more costly than an average web query; but scary articles about the immense increase in energy usage by major players don’t seem to hold up to scrutiny. While specific energy figures for the major LLMs are largely unavailable, here’s Google’s total energy consumption over the last decade; can you spot the moment Gemini started having its dramatic and damaging effect? (Remember that ChatGPT was released in 2022)&lt;/p&gt; &lt;p&gt;&lt;img src=&quot;/assets/images/google_energy.png&quot; alt=&quot;Google Energy Usage&quot; /&gt;&lt;/p&gt; &lt;p&gt;I can’t either.&lt;/p&gt; &lt;p&gt;Too, crypto was doing things that we could already do another way, much faster and much cheaper, with a central database; LLMs are doing something computers couldn’t do before. It’s perfectly fair to argue we don’t need that thing, but it’s a weaker argument, and hard to make the data speak loudly enough for this to be an effective deterrent, I think.&lt;/p&gt; &lt;h2 id=&quot;training-generative-models-on-scraped-data-is-theft&quot;&gt;“Training generative models on scraped data is theft.”&lt;/h2&gt; &lt;p&gt;&lt;strong&gt;Note from Feb 12, 2024: If &lt;a href=&quot;https://storage.courtlistener.com/recap/gov.uscourts.ded.72109/gov.uscourts.ded.72109.770.0.pdf&quot;&gt;Judge Bibas’ judgement&lt;/a&gt; holds, this section is inaccurate — a direct commercial impact outweighs the fact that the trained work doesn’t appear in the delivered result.&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;This has a good argument at its core, but it’s almost always phrased in a way that I think is morally complicated and potentially legally unsuccessful.&lt;/p&gt; &lt;p&gt;I’ll pursue the legal argument in the US because that’s the law I’m most familiar with, but I believe a similar framework would apply to any attempt at allowing or banning AI training on open web data in the EU, too. Copyright law is complex, the existing precedent can seem contradictory and confusing, and I’m not a lawyer let alone an international intellectual property lawyer, so take all of this with a grain of salt.&lt;/p&gt; &lt;p&gt;In both cases, the only way an action can legally be theft is if it is by distributing a reproduction. The act of scraping published work and storing it in a private database is not, as I understand it, an infringement. If an article is openly published, you’re allowed to save that article and read it later — in fact &lt;em&gt;just by reading it&lt;/em&gt; a copy was made, stored on your computer, and displayed. You’re allowed to print it out, to cut it up, to lick it, whatever. What you’re not allowed to do is &lt;em&gt;sell access to your copy&lt;/em&gt;, or make a billion copies to give away: &lt;em&gt;distribution&lt;/em&gt; is an infringement.&lt;/p&gt; &lt;p&gt;The question of theft, then, is not in the &lt;em&gt;training&lt;/em&gt; process at all; I am not a lawyer, but I’m pretty sure doing even very complicated statistics on open-web data is allowed. The question is whether the trained LLM “contains” the original copyrighted work, in whole or in part; and whether whatever gets distributed (the entire model or merely its output) will contain that data.&lt;/p&gt; &lt;p&gt;The answer is “yes, probably, maybe.” It’s reasonably easy to prove that at least some training data is ‘memorized’ by the model – not just analyzed and abstracted but memorized verbatim. The major players, like OpenAI, are doing their best to add layers on top of the LLM to prevent simple exposure of copyrighted training data – it’s easy to run into a “against terms of service” warning if you try obvious ways of prompting for such things – but the data is there, and can be exposed in more creative ways that don’t involve direct prompting at all: see &lt;a href=&quot;https://systems.cs.columbia.edu/private-systems-class/papers/Carlini2023Scalable.pdf&quot;&gt;this paper&lt;/a&gt; from late last year.&lt;/p&gt; &lt;p&gt;Because these methods produce &lt;em&gt;random&lt;/em&gt; memorized data, rather than specific info, it might be hard to find someone with standing to sue; but in the US, a lawsuit about this would mean a fair use factor analysis, and the ‘market impact’ factor is as large as I think it’s possible to imagine a derivative work having.&lt;/p&gt; &lt;p&gt;I understand that this is a disappointing version of the argument — what creators want is to prevent the training at all, not to allow distribution of a version of the LLM with very careful shackles — but if you made me place a bet on a legal argument working in court, it would be this one. If you can provide court-approved proof that the work is present in the model verbatim, the simplest way for an LLM-builder to prove that it &lt;em&gt;cannot&lt;/em&gt; appear in the output is to allow opting out as input.&lt;/p&gt; &lt;p&gt;So I’d rephrase the argument this way: Generative models memorize and reproduce copyrighted material, which is illegal; and they do so in unexpected ways, so the user might not even know the response contains plagiarized material.&lt;/p&gt; &lt;h2 id=&quot;generative-ai-cant-be-creative-it-can-only-regurgitate-old-ideas-or-its-just-autocomplete&quot;&gt;“Generative AI can’t be creative, it can only regurgitate old ideas” or “It’s just autocomplete”&lt;/h2&gt; &lt;p&gt;This is just straightforwardly a bad take. It’s &lt;em&gt;trivial&lt;/em&gt; to get LLMs to generate new ideas. Creativity is easy to extract from randomness; it’s more technically impressive that generative AI can be uncreative, and produce trope-ridden schlock.&lt;/p&gt; &lt;p&gt;Try prompting an LLM with “invent and briefly describe a new art movement” or “describe an unusual artwork made by an imaginary artist”. The results are not necessarily thrilling artistic achievements, and I have no fear of LLMs taking over the conceptual art scene unaided, but they’re not descriptions of anything that exists.&lt;/p&gt; &lt;p&gt;Or, more prosaically, “write a sentence that has never been uttered before”.&lt;/p&gt; &lt;p&gt;You can make an argument about the &lt;em&gt;quality, value, and validity&lt;/em&gt; of the ideas an LLM generates, but even shuffling a deck of index cards can generate new ideas from old ones.&lt;/p&gt; &lt;h2 id=&quot;using-it-is-lazy-or-it-takes-no-skill&quot;&gt;“Using it is lazy” or “it takes no skill”&lt;/h2&gt; &lt;p&gt;This is an &lt;em&gt;old&lt;/em&gt; take. Many clever tools are &lt;em&gt;most often&lt;/em&gt; used in lazy ways; but it is more morally sound to criticize the laziness, not the tool. A camera is a lazy way of making a picture — except when it’s not.&lt;/p&gt; &lt;p&gt;The LLM doesn’t determine how much or how little effort goes into the work around the act of generation; the user does.&lt;/p&gt; &lt;p&gt;I &lt;em&gt;hate&lt;/em&gt; listening to people talk about using LLMs to write entire stories. But I have no objections to someone using an LLM to find 5 more ways of expressing a sentence when their first two tries didn’t work. Mechanically producing infinite variations on a single theme is something artists have been doing for ages, with whatever technology was available to do so. If the other objections can be dealt with, there’s no reason creative people can’t spend just as much sweat and blood with this in their toolbox as without it.&lt;/p&gt; &lt;h2 id=&quot;generative-ai-will-cost-us-jobs&quot;&gt;“Generative AI will cost us jobs”&lt;/h2&gt; &lt;p&gt;This is a good one! I think the only tweaking it needs is to be more immediate and particular: “Generative AI &lt;em&gt;is costing&lt;/em&gt; jobs.” The scaremongering of “art is dead, creative writers will have no jobs, and it’ll come for you next!” is maybe less honest than “copywriters are &lt;em&gt;already&lt;/em&gt; losing work; so are stock photographers and some kinds of illustrators.” An argument about the future implications is hard, and it’s easy to be wrong. An argument about what is already occurring is undeniable, with no risk of being proven wrong in a year.&lt;/p&gt; &lt;h2 id=&quot;fin&quot;&gt;fin&lt;/h2&gt; &lt;p&gt;Are there other arguments against LLM use that you think are more effective? Is your team currently building an LLM into your product, and you think it’s the right choice despite these arguments? Feel free to reach out and bend my ear.&lt;/p&gt;</content><author><name></name></author><category term="software"/><summary type="html">I hang out with writers and artists along with technical people, and the conversation frequently turns to generative AI, both the text and image kind. The sentiment is rarely positive, and I am sympathetic. I think generative AI is overhyped, the wrong choice for many situations, and is already costing people their livelihoods.</summary></entry><entry><title type="html">AI in 2024: What Good is Bullshit?</title><link href="https://sambleckley.com/writing/AI-2024-I-bullshit.html" rel="alternate" type="text/html" title="AI in 2024: What Good is Bullshit?"/><published>2024-08-16T00:00:00-04:00</published><updated>2024-08-16T00:00:00-04:00</updated><id>https://sambleckley.com/writing/AI-2024-I-bullshit</id><content type="html" xml:base="https://sambleckley.com/writing/AI-2024-I-bullshit.html">&lt;p&gt;It’s been a year since I last wrote about generative AI and its practical application, and some of my thoughts have solidified in that time, so I thought I might be due to revisit the topic. I expect this to come in three relatively short parts, of which this is part one.&lt;/p&gt; &lt;h2 id=&quot;generative-ai-is-bullshit&quot;&gt;Generative AI is Bullshit&lt;/h2&gt; &lt;p&gt;One of my favorite papers this year so far has been &lt;a href=&quot;https://link.springer.com/article/10.1007/s10676-024-09775-5&quot;&gt;Generative AI is Bullshit&lt;/a&gt;, in Ethics and Information Technology by Hicks, Humphries, and Slater. Despite its inflammatory title, it presents a straightforward and largely semantic argument that I think is very useful when thinking about potential uses of generative AI.&lt;/p&gt; &lt;p&gt;I find myself making reference to it over and over again, and I don’t want to force people to read a lengthy academic paper, so here’s a 3-point summary (the paper itself is very readable, though, and if that sort of thing is your idea of fun, click through and read it instead. If you’ve done that, or are already familiar with the paper, feel free to skip ahead to “What is bullshit good for?”)&lt;/p&gt; &lt;ol&gt; &lt;li&gt; &lt;p&gt;&lt;strong&gt;“Bullshit” as a technical term&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;For the sake of this argument, “bullshit” means anything that is intended to be &lt;em&gt;plausible&lt;/em&gt; but is unconcerned with &lt;em&gt;truth&lt;/em&gt;. Something purely and purposefully false is a lie; but when it &lt;em&gt;doesn’t matter&lt;/em&gt; whether it’s true or false as long as it is believable, that’s bullshit.&lt;/p&gt; &lt;/li&gt; &lt;li&gt; &lt;p&gt;&lt;strong&gt;LLMs are explicitly trained to bullshit&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;An LLM is given &lt;em&gt;all the text its trainers can get their hands on&lt;/em&gt;, true, false, fiction, non-fiction, opinion, fantasy, shitpost, diatribe, peer-reviewed academic study…&lt;/p&gt; &lt;p&gt;Training optimizes the LLM’s ability to predict the next word of that text, given some chunk of it. If the text is a complete fabrication written by a crackpot, it still predicts the next word. If the text is truth handed down by a god, it predicts the next word. If the text is a dadaist poem, it predicts the next word. It does not have the ability to say “hold on, that’s nonsense!” it can only say “it is unlikely &lt;em&gt;anyone&lt;/em&gt; would ever say that”.&lt;/p&gt; &lt;/li&gt; &lt;li&gt; &lt;p&gt;&lt;strong&gt;‘Hallucinations’ and correct answers are the same side of the same coin&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;It’s so tempting to think about the times an LLM generates blatantly untrue text as somehow &lt;em&gt;different&lt;/em&gt; from when it generates true text; that we can work to fix the former and keep the latter.&lt;/p&gt; &lt;p&gt;But the same process generates both, and that process is inherently indifferent to truth. Both are bullshit!&lt;/p&gt; &lt;/li&gt; &lt;/ol&gt; &lt;h2 id=&quot;what-is-bullshit-good-for&quot;&gt;What is bullshit good for?&lt;/h2&gt; &lt;p&gt;Just because generative LLMs produce bullshit doesn’t mean they’re useless. You just need to carefully pick tasks where good, high-quality bullshit is all you need.&lt;/p&gt; &lt;p&gt;That sounds very limiting, but one key quality of bullshit is that it’s internally consistent; as soon as it contradicts itself, it loses plausibility. LLMs are not perfect bullshit generators, but a lot of their odd behavior can be understood in terms of this pressure towards consistency; very little training text, even crackpot text, outright changes its tune mid-thought.&lt;/p&gt; &lt;p&gt;Tasks that require agreement with the real world will sometimes fail catastrophically. Tasks that only require plausible consistency with the prompt are safer bets. Some examples:&lt;/p&gt; &lt;ul&gt; &lt;li&gt;Walking someone through handling a medical crisis? Hell no!&lt;/li&gt; &lt;li&gt;Explaining scientific concepts to students? No, probably dangerous.&lt;/li&gt; &lt;li&gt;Summarizing a correct but lengthy description of a scientific topic down to a couple sentences? This is on the edge of its ability; it’s much harder to make a summary that’s plausible but incorrect when the document is right there.&lt;/li&gt; &lt;li&gt;Rephrasing a correct short summary in different words, or according to some constraint? Now we’re talking!&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;This is one reason why retrieval-augmented generation makes for more effective chatbots than training-based specialization: retrieving existing (and truthful) documents is a search task, and then rephrasing their contents to fit the conversation merely requires consistency, and so plausibility is often enough of a constraint to maintain that truth.&lt;/p&gt; &lt;p&gt;Is a generative LLM a good choice for my task? Questions to ask yourself:&lt;/p&gt; &lt;ul&gt; &lt;li&gt;Am I OK with producing falsehoods with some frequency? &lt;em&gt;or&lt;/em&gt;&lt;/li&gt; &lt;li&gt;Can I provide enough context that even limited internal consistency of the response with the prompt will incidentally ensure correctness? &lt;em&gt;and&lt;/em&gt;&lt;/li&gt; &lt;li&gt;Am I in total control of the prompt, or will users or other parties be able to make the prompt inconsistent, leading to poor results?&lt;/li&gt; &lt;/ul&gt; &lt;p&gt;This is considerably simpler than most of my question-based discernment tools! I suspect you can guess from the last of these questions that I’m &lt;a href=&quot;/writing/build-a-product-not-a-chatbot.html&quot;&gt;still bearish on AI-powered chatbots&lt;/a&gt;. I don’t think exposing a raw LLM to users is a sensible approach for reliability, security, expense, or user experience – even with RAG involved.&lt;/p&gt; &lt;p&gt;That’s not to say I am against the vector database portion of RAG. These questions apply only to generative uses of LLMs. It is the non-generative uses which I think are far more exciting in terms of application, and I intend to return to that subject in part III of this series.&lt;/p&gt; &lt;p&gt;The conclusion of this part, however, is that I still believe, like I did a year ago, that too many companies and individuals are relying on LLMs to produce concise, human-readable truth, when what they’re good at is producing plausible filler. LLMs don’t have to be a flash-in-the-pan fad forced into your product and then abandoned; play to their strengths, and you’ll see longer-lasting success.&lt;/p&gt;</content><author><name></name></author><category term="software"/><summary type="html">It’s been a year since I last wrote about generative AI and its practical application, and some of my thoughts have solidified in that time, so I thought I might be due to revisit the topic. I expect this to come in three relatively short parts, of which this is part one.</summary></entry></feed>