This episode is brought to you by Progressive. Most of you aren't just listening right now. You're driving, cleaning, and even exercising. But what if you could be saving money by switching to Progressive?
Drivers who save by switching save nearly $750 on average, and auto customers qualify for an average of seven discounts. Multitask right now. Quote today at Progressive.com. Progressive Casualty Insurance Company and Affiliates. National average 12-month savings of $744 by new customers surveyed who saved with Progressive between June 2022 and May 2023. Potential savings will vary. Discounts not available in all states and situations.
My Wrangler jeans from Walmart are legit my favorite go-to pants. They got that slim cut that's always fresh for going out. Hey, what's up? They're durable enough, even for my shift, and stretchy enough for when I want to kick back and chill with a movie. So basically, they can do it all, and on my budget. I mean, come on. You really can't beat all that. Shop your Wrangler pants at Walmart. Listener supported. WNYC Studios.
This is the On the Media Midweek podcast. I'm Brooke Gladstone. In the first half of the last school year, PEN America has recorded almost 900 different books pulled from library shelves across the country. The American Library Association are tracking more book bans than ever, and many of them are aimed at books with the LGBTQ plus themes.
Teachers in Florida had to cover up their bookshelves for fear of getting sanctioned or fired. Mothers for Liberty, a Florida-based conservative political group, has been campaigning fervently for book bans in U.S. public schools with some success. Not since the Daughters of the Confederacy has there been a conservative women's organization as influential as Moms for Liberty. They're taking the lead in getting books banned all over the country, are directly allied with Governor and now
and fight the woke Ron DeSantis, and they are already winning streak. As long as libraries have existed, people have tried to police what goes in them. But for some, the ideal library is not one that excludes authors, but rather a place that comprises all of them. For centuries, scientists and inventors, philosophers and programmers have been inspired to envision or even build a better library, a perfect library.
A perfect library, one that stocks every book ever written. The kind of library that may have actually once existed. Late last year, On the Media producer Molly Schwartz went to her local library to meet some of the people trying to build a universal repository of human knowledge to learn what kind of progress they've made and what keeps the dream alive. It's a gorgeous fall Saturday in Brooklyn.
mild chill in the air, colorful leaves, general good vibes, and I'm on my way to a birthday party at the Brooklyn Public Library at 9.30 in the morning. Welcome everyone to Wikidata Day. Today is Wikidata Day. It's the 10th anniversary of Wikidata. So Wikidata is sort of the data science side of Wikipedia. You know Wikipedia, the free online encyclopedia with millions of articles in hundreds of languages, all written by volunteers. Jim Henderson is one of them. And today he's wearing two hats.
Literally. This is the data hat, something I ordered when I was on the board of directors of the local club. The data hat says, I heart Q60. Q60 is Wikidata for New York City. On top of it, he's wearing a beanie that says Wikimania Cape Town. When we had our next to last Wikimania Worldwide Convention. So we have this kind of mission statement for the Wikimedia movement. James Forrester is a software engineer at the Wikimedia Foundation. Imagine a world in which all people have access to the sum of human knowledge.
providing everyone on the planet access to the sum of human knowledge. That's the prime objective of Wikipedia, as stated by co-founder Jimmy Wales. I mean, it's a mission statement, right? You're not meant to achieve them. You're meant to move towards them. And definitely we've moved a huge amount towards them in the last 20 years, pushed the ball along the road a little bit. As the day goes on, I learn about Wikidata properties and qualifiers.
I also get in a little bit of trouble because On the Media's Wikipedia page isn't up to date. Added Suzanne Gaber. Publish the changes. And that's done. We need some more links. And I spoke with someone who thought a lot about universal libraries and how they work. I grew up with the 1940s Britannica and the 1960s World Book, and I wanted to contribute to the sum of knowledge.
Richard Kneipel is the president of Wikimedia New York City, but in the world of Wikipedia, he's known by his username, Pharos. Named after the Pharos of Alexandria, the lighthouse of Alexandria. It's in homage sort of to the Library of Alexandria. Perhaps the closest thing there ever was to a universal library.
a bastion of all the world's knowledge for all who seek it. We actually had our international Wikimedia conference. We had Wikimania was in Alexandria a few years ago. And people do feel a strong cultural resonance with Library of Alexandria and other universalizing attempts at knowledge. For some reason, the Library of Alexandria has captured people's imagination. It was certainly the largest library of its era. Alex Wright is the author of the book Glut, Mastering Information Through the Ages.
He says the Library of Alexandria was built in Egypt in the 3rd century BCE, likely by decree of the pharaoh Ptolemy I. And he tried to attract as many notable scholars as he could to come and contribute to the collective enterprise of building not just a library, but a university and a center of learning. Ptolemy's mandate for the Library of Alexandria was as ambitious as it was simple. Collect everything. Every papyrus scroll, every book, every manuscript.
by force if necessary. When ships would come to Alexandria, officials would basically seize the books on the ship and add them to the library. They lifted books from private citizens, stole them from docked boats, and allegedly took books via subterfuge from Athens.
But despite the Ptolemies' best efforts, Alexandria could never really compete with Athens. Athens was this organic center of culture and learning, whereas in Alexandria, all the scholars were entirely beholden to their employer, the pharaoh. So as the Ptolemaic empire crumbled, so did the library. There's this deeply intertwined relationship between libraries and statesmen.
state or governmental power. And you find that the great libraries of the world have, not coincidentally, emerged alongside powerful empires or civilizations. What happened to the library is actually unclear. Some say Julius Caesar burned it down. Others say a conquering Muslim commander burned the books.
And others say that the library never succumbed to a fire at all, but rather to years of neglect and changing empires. We don't know for sure exactly what happened. We do know for sure that the library no longer exists.
and that the 500,000-odd volumes of material there have for the most part been lost to posterity. And yet there's something apparently kind of energizing about that ideal that has inspired a lot of people over the years to try to work towards some kind of universal repository of recorded information. Versions of Universal Library's pepper science and speculative fiction.
From Jorge Luis Borges' magical Library of Babel... The universe, which others call the library, is composed of an indefinite, perhaps infinite number of hexagonal galleries. To Isaac Asimov's Imperial Library and the Foundation series. It was just an imperial library, enchanted in the stacks. The ceiling was wooden and there were all these marble busts.
To Douglas Adams' Hitchhiker's Guide to the Galaxy. The Hitchhiker's Guide has already supplanted the great Encyclopedia Galactica as the standard repository of all knowledge and wisdom. To the TV show Doctor Who. The library. Every book ever written, whole continents of Jeffrey Archer, Bridget Jones.
Monty Python's big red book. There's even a universal library in the world of the occult, according to theosophists. The Akashic Records is a place within a different dimension. It's a higher dimensional energy that is like the library of the universe. It holds all the records of the universe and anybody can tap into this energy, into this knowledge and access it for themselves.
Around the invention of Gutenberg's printing press, the Vatican Library was also founded. According to Pope Nicholas V, the goal was ensuring, quote, "...for the common convenience of the learned, we may have a library of all books in both Latin and Greek that is worthy of the dignity of the Pope and the Apostolic See."
And then in the late 19th century... Suddenly, printing of books became an industrialized, mechanized affair. Alex Wright. And you started to see this explosion of popular literature. Magazines, what they sometimes called penny dreadfuls, these cheap little precursors to tabloids.
It was during this explosion of books that people started paying attention to how to organize and retrieve them using universal classification systems. That was when Melville Dewey invented his decimal classification. There was another guy named Charles Cutter working the Boston Athenaeum who developed a different classification system that's now used in a lot of academic libraries and
India. There was a librarian named Ranganathan who developed a really highly sophisticated method that he called faceted classification. And then in the 1930s, with huge leaps in technological progress following World War I, came another burst. This is when the visionary H.G. Wells published a series of short stories.
In them, he explained a concept called the world brain. There is no practical obstacle, whatever now, to the creation of an efficient index to all human knowledge, ideas, and achievements.
to the creation, that is, of a complete planetary memory for all mankind. Around the same time, Paul Atlet, a librarian in Belgium, envisioned the world book. Here, the workspace is no longer cluttered with any books. In their place, a screen and a telephone within reach. Over there, in an immense edifice, are all the books and information. Cinema, phonographs,
Radio, television, these instruments will in fact become the new book. And Vannevar Bush, who worked for President Truman as what was effectively the nation's first science advisor, wrote about a memory supplement that he called the Memex. The analytical machine which will supplement a man's thinking method, which will think for it,
will have as great an effect as the invention of the machine way back took the load off of men by giving them mechanical power instead of the power of their muscles.
Basically, these imagined groundbreaking gizmos are proto-computers and proto-search engines. They were all invented, and more and better. And with all of this came another round of optimism, starting in the 90s and picking up speed in the aughts in 2010s, that the dream of a universal library could be realized.
via the internet. I'm a librarian, and the idea of using technology is perfect for us. That's Brewster Kaeligan, the founder of the Internet Archive, who we heard from earlier in the show. I think we can one-up the Greeks and achieve something. Yeah, we could actually achieve the great vision of everything ever published, everything that was ever meant for distribution, available to anybody in the world that's ever wanted to have access to it.
There was a host of these projects all at the same time because people all had the same vision. Put books online. There was also the Universal Library Project, which was later largely supplanted by HathiTrust, which is this massive cache of digital content that's available to a group of research libraries.
And then there were projects ranging from the World Digital Library to the Digital Public Library of America and all kinds of smaller offshoots. Walk into the Bexar County Digital Library in San Antonio, Texas, and you'll see plenty of screens, but zero books. This doesn't look like a library. No. That's the point. That's the point. But the one that probably had the grandest vision, the most top-down, the most completest, was the Google Books Project.
Google transported truckloads of books to their massive scanning centers across America. And they got their technology good enough that they were scanning about 1,000 pages an hour. Since 2004, they've digitized over 20 million books. And they have plans to literally digitize every book in the world. Google Books was a huge deal, the company's first moonshot, the first salvo in a revolution. What is being discussed tonight is not your ordinary kind of revolution, like cars and jets.
It is a super revolution.
like writing and printing and computers. Google Books was sued by the Authors Guild for violating copyright. Ultimately, Google won, but only because they only show snippets of most books, a far cry from the initial vision of universal access. Unfortunately, I'm still using a mobile internet plan. Jule Lakatos is a software engineering consultant. He lives in a house on a hill in a small town in Hungary on the banks of the Danube.
too far from Budapest to get high-speed internet. And that's a problem because he's working on a very large project housed in two servers stashed behind him. One of them is running 20 hard drives. The other one is 10. Lakatos is using those servers to save just a little bit of today's digital flood for posterity.
inspired by deep admiration for the ancient empires of Greece and Rome, and fearful that modern civilization will suffer the same fate as those lost knowledge centers. As far as I know, only like around 1% of books and documents survived from classical antiquity. Which made him wonder, what's going to be left from life today? So I created an application suite. It's not just one, it's seven applications.
You can deploy these applications to crawl the web. He spent about $6,000 on equipment that ran his applications for two years, amassing over 90 million documents onto the servers in his house. His application suite is open source and available on GitHub, and you'll never guess what it's called. It's called Library of Alexandria. At this point, he's collecting around 2 million documents a week. They include everything from interesting, complex doctoral dissertations...
to the kind of ephemera of restaurant menus, to a weird collection of Russian passports. It's a mishmash of the valuable and long-forgotten, all hoovered up and stored in the hills of Hungary. Lakatos wants to open up his treasure trove to the public,
But for now, it all just lives on his servers because he's afraid about copyright laws. And navigating copyright laws for 90 million documents? That's a job for more than just one person. I don't really want to host it, to be honest. I'm a lazy person. I just want to search in that library. That would be a lot of fun. I came across Lakatos' project on a subreddit called Data Hoarder. It's a forum for people with a kind of unusual hobby, trying to preserve things they find on the internet.
They hoard this data in their living rooms and corners of the web, and sometimes on places where a lot of the web is hosted, Amazon Web Services or AWS. It's all to try to preserve a first draft of history for the future. Humanity in general can lose a lot of knowledge out of nowhere for no reason. Imagine that, for example, a fire is starting in one of the AWS warehouses where a lot of things are hosted and people just lose their data. Like the whole data hoarder
Subreddit is very concerned about this, and I just wanted to kind of notify people with this name a little bit, like, or warn them a little bit more. I appreciate these data hoarders. I got my degree in library science because when I stood in the rare books room at my university, I felt a sense of awe, like I'm part of a story that began long before I was born and will go on long after I'm gone. I hope. I hope.
In the glut of information, more gets lost than saved. And that's not always a bad thing. One of the first things an archivist learns is that the best way to save things is to know what to throw away. Have you ever noticed that the material on which knowledge is stored has gotten more ephemeral with time? From carved stone, to parchment, to paper, to tape and floppy disks, to drives for outdated devices, to everything stored in a cloud preyed to all kinds of terrestrial and cosmic events?
The fact is, preserving all the world's knowledge is like building a dam against the unyielding torrents of time. It's impossible. But if we don't keep trying, how will anyone know we were ever here? For On The Media, I'm Molly Schwartz. Thanks for listening to this week's Midweek Podcast. I'm Brooke Gladstone.