Posts about microsoft

Journalism is lossy compression

There has been much praise in human chat — Twitter — about Ted Chiang’s New Yorker piece on machine chat — ChatGPT. Because New Yorker; because Ted Chiang. He makes a clever comparison between lossy compression — how JPEGs or MP3s save a good-enough artifact of a thing, with some pieces missing and fudged to save space — and large-language models, which learn from and spit back but do not record the entire web. “Think of ChatGTP as a blurry JPEG of all the text on the Web,” he instructs. 

What strikes me about the piece is how unselfaware media are when covering technology.

For what is journalism itself but lossy compression of the world? To save space, the journalist cannot and does not save or report everything known about an issue or event, compressing what is learned into so many available inches of type. For that matter, what is a library or a museum or a curriculum but lossy compression — that which fits? What is culture but lossy compression of creativity? As Umberto Eco said, “Now more than ever, we realize that culture is made up of what remains after everything else has been forgotten.”

Chiang analogizes ChatGPT et al to a computational Xerox machine that made an error because it extrapolated one set of bits for others. Matthew Kirschenbaum quibbles:

Agreed. This reminds me of the sometimes rancorous debate between Elizabeth Eisenstein, credited as the founder of the discipline of book history, and her chief critic, Adrian Johns. Eisenstein valued fixity as a key attribute of print, its authority and thus its culture. “Typographical fixity,” she said, “is a basic prerequisite for the rapid advancement of learning.” Johns dismissed her idea of print culture, arguing that early books were not fixed and authoritative but often sloppy and wrong (which Eisenstein also said). They were both right. Early books were filled with errors and, as Eisenstein pointed out, spread disinformation. “But new forms of scurrilous gossip, erotic fantasy, idle pleasure-seeking, and freethinking were also linked” to printing, she wrote. “Like piety, pornography assumed new forms.” It took time for print to earn its reputation of uniformity, accuracy, and quality and for new institutions — editing and publishing — to imbue the form with authority. 

That is precisely the process we are witnessing now with the new technologies of the day. The problem, often, is that we — especially journalists — make assumptions and set expectations about the new based on the analog and presumptions of the old. 

Media have been making quite the fuss about ChatGPT, declaring in many a headline that Google better watch out because it could replace its Search. As we all know by now, Microsoft is adding ChatGPT to its Bing and Google is said to have stumbled in its announcements about large-language models and search last week. 

But it’s evident that the large-language models we have seen so far are not yet good for search or for factual divination; see the Stochastic Parrots paper that got Tinmit Gebru fired from Google; see also her coauthor Emily Bender’s continuing and cautionary writing on the topic. Then read David Weinberger’s Everyday Chaos, an excellent and slightly ahead of its moment explanation of what artificial intelligence, machine learning, and large language models do. They predict. They take their learnings — whether from the web or some other large set of data — and predict what might happen next or what should come next in a sequence of, say, words. (I wrote about his book here.) 

Said Weinberger: “Our new engines of prediction are able to make more accurate predictions and to make predictions in domains that we used to think were impervious to them because this new technology can handle far more data, constrained by fewer human expectations about how that data fits together, with more complex rules, more complex interdependencies, and more sensitivity to starting points.”

To predict the next, best word in a sequence is a different task from finding the correct answer to a math problem or verifying a factual assertion or searching for the best match to a query. This is not to say that these functions cannot be added onto large-language models as rhetorical machines. As Google and Microsoft are about to learn, these functions damned well better be bolted together before LLMs are unleashed on the world with the promise of accuracy. 

When media report on these new technologies they too often ignore underlying lessons about what they say about us. They too often set high expectations — ChatGPT can replace search! — and then delight in shooting down those expectations — ChatGPT made mistakes!

Chiang wishes ChatGPT to search and calculate and compose and when it is not good at those tasks, he all but dismisses the utility of LLMs. As a writer, he just might be engaging in wishful thinking. Here I speculate about how ChatGPT might help expand literacy and also devalue the special status of the writer in society. In my upcoming book, The Gutenberg Parenthesis (preorder here /plug), I note that it was not until a century and a half after Gutenberg that major innovation occurred with print: the invention of the essay (Montaigne), the modern novel (Cervantes), and the newspaper. We are early our progression of learning what we can do with new technologies such as large-language models. It may be too early to use them in certain circumstances (e.g., search) but it is also too early to dismiss them.

It is equally important to recognize the faults in these technologies — and the faults that they expose in us — and understand the source of each. Large-language models such as ChatGPT and Google’s LaMDA are trained on, among other things, the web, which is to say society’s sooty exhaust, carrying all the errors, mistakes, conspiracies, biases, bigotries, presumptions, and stupidities — as well as genius — of humanity online. When we blame an algorithm for exhibiting bias we should start with the realization that it is reflecting our own biases. We must fix both: the data it learns from and the underlying corruption in society’s soul. 

Chiang’s story is lossy in that he quotes and cites none of the many scientists, researchers, and philosophers who are working in the field, making it as difficult as ChatGPT does to track down the source of his logic and conclusions.

The lossiest algorithm of all is the form of story. Said Weinberger:

Why have we so insisted on turning complex histories into simple stories? Marshall McLuhan was right: the medium is the message. We shrank our ideas to fit on pages sewn in a sequence that we then glued between cardboard stops. Books are good at telling stories and bad at guiding us through knowledge that bursts out in every conceivable direction, as all knowledge does when we let it.
But now the medium of our daily experiences — the internet — has the capacity, the connections, and the engine needed to express the richly chaotic nature of the world.

In the end, Chiang prefers the web to an algorithm’s rephrasing of it. Hurrah for the web. 

We are only beginning to learn what the net can and cannot do, what is good and bad from it, what we should or should not make of it, what it reflects in us. The institutions created to grant print fixity and authority — editing and publishing — are proving inadequate to cope with the scale of speech (aka content) online. The current, temporary proprietors of the net, the platforms, are also so far not up to the task. We will need to overhaul or invent new institutions to grapple with issues of credibility and quality, to discover and recommend and nurture talent and authority. As with print, that will take time, more time than journalists have to file their next story.


 Original painting by Johannes Vermeer; transformed (pixelated) by acagastya., CC0, via Wikimedia Commons

La vita cloudy

I’ve done it. I’ve moved entirely into the cloud. The process started a year ago when I bought my Chromebook Pixel. Well, actually, it started before that, when I shifted to Gmail and Google Calendar and Google Docs and that drove me to switch from iPhones to and Android phones and tablet and then to try a Chromebook. I still owned a Mac, but it did less and less, in the end just acting as a print server for my Google Cloud Print and as a Skype machine because (1) Microsoft refuses to make Skype for Chrome and (2) Leo Laporte whined about my using Google Hangouts on This Week in Google.

But last week, my Mac died. I/O error. I/O error. OK, OK, I get the point. It’s four or five years old; not worth fixing. It so happens that the moment it died I was trying to set up a Skype talk into a conference in Las Vegas. They couldn’t do Hangouts. So I had to call in on Skype from my Nexus 7 tablet and it worked. Check off one use for the old, dead Mac.

I went through a few false starts trying to check off the other function: printing. I got a Lantronix PrintServer for Google Cloud Print but it still required me to set up printers in Google and that still required having a Mac or PC. I’m using the Lantronix but I also wanted to make this test pure: no other computer required. I got a Brother printer that was alleged to be a Google Cloud Print ready but it wasn’t really. Then I got an Epson and it worked. The Epson has a web set-up I could handle on my Chromebook, arranging to print directly to it with no middleman. It even sends scans directly to my Google Drive.

Ding, dong, my personal computer is dead. I bought my first machine, an Osborne 1, in 1981. I turned off my last one 33 years later. Leo Laporte, Gina Trapani, and I talked about this at some length at the start of This Week in Google. Now Leo’s had some fun at the expense of my Pixel, though he has come around to like his. And so I asked whether for lots of people, we’ve moved past the idea of needing to own a computer that stores data and runs applications locally.

Of course, this move still depends on what you need to do with a computer. I write — in fact, I’ve just written a 55,000-word tome about the future of journalism (betcha can’t wait for that!) using my Chomebook and Google Docs and Drive. I use the web — Chrome, of course. I communicate — everything I could need except Skype. I share. I do basic photo editing. I don’t do rigorous photo or video editing; for that, I’d still need local storage and computing. Gina says she still needs to code locally. OK, but all that, too, could change as connections speed up to gigabit speed and as remote apps and servers continue to gain power over what a personal machine could do.

We also discussed the need for a security blanket: backup. As we chatted, folks in the TWiT chatroom gave us suggestions for local hard drives and for online services such as Backupify that can backup or sync data to another service, such as Dropbox. I’ll work that out next. (In the meantime, I backed up my tome to a thumbdrive.)

So now I live in the cloud. It doesn’t really matter what device I use to get to my stuff: my Chromebook, a computer anywhere with Chrome on it, my Android phone or tablet. I still run apps, but they, like my stuff, will follow me around.

Oh, and by the way, for the first time in decades, I no longer use any Apple or Microsoft products. That’s not because I have anything against either. I just don’t need them.

Welcome to the next era of personal computing without a personal computer.

Living the Google life

I was about to sit down and write an aria of praise to living the Google life, now that I have transitioned fully from my iPhone, iPad, and Mac and functioned fully for a few months with Android, Chrome, and services from Gmail to Google Calendar to Google Now to Google Reader on my Nexus 4, Nexus 7, Chromebook and now Chromebook Pixel.

But this turns into a cautionary tale as well with the news last night that Google is killing Reader. Godogle giveth, Godogle taketh away. This is the problem of handing over one’s digital life to one company, which can fail or unilaterally kill a service users depend on. Google has the right to kill a shrinking service. But it also has a responsibility to those who depended on it and in this case to the principle of RSS and how it has opened up the web and media. I agree with Tim O’Reilly that at the minimum, Google should open-source Reader.

The killing of Reader sends an unfortunate signal about whether we can count on Google to continue other services we come to need. Note well that what drove me to Google hardware was Google’s services — and now I depend on them even more. I have relied on Gmail and especially its Priority Inbox for ages. Once I finally shifted to Google Calendar et al, I found them awkward on the iPhone and so I moved to Android to try it out; there, I stayed. When the $249 Samsung Chromebook came out, I realized that I was doing most of my work only on the web, and so I decided to try to move entirely to Google Drive and Chrome. I found both transitions surprisingly easy, including working in Drive and Gmail offline. With one small and one large exception, I haven’t touched a Microsoft application for months.

The large exception is Skype, which Microsoft happens to own now. There is no Chrome app for it. I still need Skype to be on This Week in Google. So when I last went to Europe, I had to lug both my Chromebook and my Macbook with me.

But that problem was solved last night. Thanks to a helpful Google+ user, Michael Westbay, and through Kevin Tofel and Liliputing I managed to install Ubuntu Linux on the Google Chromebook so with one button I can switch from one to the other. Insert Tarzan yell here. Skype never looked better on TWiG. See for yourself:

Now to the details. Let’s start with the Chromebook Pixel. I have a review unit from Google. I so fell in love with it that after 24 hours I ordered my own — the high-end with LTE built in, for there’s nothing better than being away from wifi and suddenly finding oneself connected to the world. The screen is magnificent, which is soothing wonder to my old and hobbled eyes. The keyboard is pure butter; I only wish I could write as smoothly as I can type now. It’s fast. The machine is solid — physically and in its operation. The battery life could be better but I’m finding it does last the full five hours.

I had been managing fine on the Samsung Chromebook. But it was tinny. The screen wasn’t gorgeous. There was too little memory, which caused web pages to refresh too often. Still, for $249, I had little basis for complaint. This is a wonderful machine for students and travelers; I’d recommend it. I took it on trips as my only machine and did fine. As long as I remembered to open and refresh Drive and offline Gmail app while I was still connected, before getting on the plane, I could work when offline. The experience certainly showed me how I could live in the browser. But I wanted a slightly better machine. Then came the Pixel; it is a vastly better machine. For me, the Samsung was the gateway drug to the Pixel.

Both machines give me more Drive storage than I could possibly use. Except for one hiccup this week, Drive works well. The only other time I’ve had to use a Microsoft product was when I had to format a work document in Word. I am not sure about writing something book-length in Drive; it’s not easy to move around a large manuscript. But those things aside, it works for most anything I need to do, even presentations.

The Pixel also runs Netflix beautifully. I need to play with more Chrome apps to edit photos and video. But I tell you truthfully that I’m now not even taking my office Mac out of the drawer. I’m living in Chrome.

I’m similarly satisfied with Android, though I wish the two would integrate more and now that both are under the same leader, I hope that will happen. That Google Now will reportedly be available in both Android and Chrome is the first substantial bridge between the two. Gmail, Calendar, Maps, Voice, Google+, and Currents all operate wonderfully on my Nexus 4 and Nexus 7.

I kept playing with the idea of trying a Note II to replace both Nexus devices. I bought an unlocked AT&T model on eBay but still haven’t actually used it, as I will probably resell it. I like the size of the Nexus 4 for everyday use. I also like reading the paper and watching Breaking Bad on the Nexus 7 when I’m riding trains and airplanes.

Getting a new machine is pretty wonderful. When I turn on a new Chromebook and sign in, all my apps, bookmarks, and preferences are loaded in a minute or two. When I switch phones, I can transfer any app (though I wish I could just replicate my last phone). I am living in an ecosystem that makes sense.

So with the not inconsiderable caveat above, I’m living in Googland and happy there. Yes, at $1,500 the Pixel is expensive, but keep in mind — justification coming — that I don’t need to buy software for it. My Nexus 7 is cheaper than an iPad. My Nexus is only $300 and it’s unlocked. So I figure I also save money. I have fewer computer hassles. I can get to my data from anywhere. Come on in. The water’s fine.

Meanwhile, I bid a fond farewell to my iLife. Like an ex-girlfriend, I loved these machines in their time. I still admire them. But I don’t miss them.

Guardian column: Micohoo vs. Gulliver

My Guardian column this week on the Microhoo search lashup:

In bringing together their search traffic, Microsoft and Yahoo are fighting an unwinnable war. Worse, they are still fighting the last war. . . .

But while they pound their little fists on Google’s shins, Google remains the unchallenged giant in the arena that really matters: advertising revenue. According to the blog Search Engine Land, Google takes almost a third of all online advertising money – $21bn a year – and it doesn’t rely just on search.

And Google is turning to the next battlefields: mobile, social media, the live web, and online tools. . . .

Yahoo can now jettison the technology resources that went into search. That’s rather sad. After all, 15 years ago, it was Yahoo that first organised the web for us. Its original ambition seems quaintly naive today: human editors cataloguing every site worth visiting and deciding which were the hot ones we should visit. Back then, we, and Yahoo, thought the web was a medium, like TV, that we experienced together. Yahoo never quite broke out of that thinking. It still treats its site as a destination we have to go to with walls around it to keep us in. It just introduced a new homepage to some fanfare. Homepages are so 1999. . . .

So, let Yahoo and Microsoft celebrate their deal. Yahoo doesn’t have as much to celebrate. It turned down acquisition offers and now it gets no cash from Microsoft. And it is surrendering its earliest competence to a competitor. Microsoft has more cause to grin. It got Yahoo’s search traffic for no cash and doesn’t have to manage the rest of the old beast.

And Google? One wonders whether it notices beyond that irritating poking at its shins. It’s too busy trying to conquer what comes next.

Ballmer kills print

In an interview with the Washington Post, Steve Ballmer goes a bit farther than even I would go killing print. But that’s the problem; that’s the way print people look at it. What he’s really saying is that delivery over IP will have so much greater advantage over delivery via one-way media. Why? Interaction. He’s right.

In the next 10 years, the whole world of media, communications and advertising are going to be turned upside down — my opinion.

Here are the premises I have. Number one, there will be no media consumption left in 10 years that is not delivered over an IP network. There will be no newspapers, no magazines that are delivered in paper form. Everything gets delivered in an electronic form.

Yeah. If it’s 14 or if it’s 8, it’s immaterial to my fundamental point. . . . If we want TV to be more interactive, you’ll deliver it over an IP network. I mean, it’s sort of funny today. My son will stay up all night basically playing Xbox Live with friends that are in various parts of the world, and yet I can’t sit there in front of the TV and have the same kind of a social interaction around my favorite basketball game or golf match. It’s just because one of these things is delivered over an IP network and the other is not. . . .

Also in the world of 10 years from now, there are going to be far more producers of content than exist today. We’ve already started to see that certainly in the online world, but we’ve just scratched the surface. . . . I always take my favorite case: I grew up in Detroit. I went to a place called Detroit Country Day School. They’ve got a great basketball team. Why can’t I sit in front of my television and watch the Country Day basketball game when I know darn well it’s being video-recorded at all times? It’s there. It’s just not easy to navigate to.

In this video, he also talks about the future of advertising. Ballmer argues that it will be hard to distinguish between communication and entertainment and that advertising, commerce, and content will all blend.