Project Gutenberg – keeps getting better

Hi! I'm one of the programmers at Gutenberg. We've been improving the site a lot over the past few months (and more is coming!). If you haven't visited the page recently, it's worth checking out again: https://www.gutenberg.org/

While PG has probably gotten a lot of use and growth with the growth/maintreaming of the Internet since the 1990s, (TIL) it started back in 1971:

> Michael S. Hart began Project Gutenberg in 1971 with the digitization of the United States Declaration of Independence.[5] Hart, a student at the University of Illinois, obtained access to a Xerox Sigma V mainframe computer in the university's Materials Research Lab. […] This computer was one of the 15 nodes on ARPANET, the computer network that would become the Internet. Hart believed one day the general public would be able to access computers and decided to make works of literature available in electronic form for free. […]

* https://en.wikipedia.org/wiki/Project_Gutenberg

The best thing I ever did for my father was to buy him a kindle and an access point and show him how to use Project Gutenberg to get books. He loved the old writings (he being a GED holder who was in the Navy during Korea yet had read the entire Harvard Classics). He had a special rolled up towel he used to prop it on his lap in his favorite chair and he read and read and read. When he passed he was reading "Legends of the Jews" from 1931.

I had some small e-correspondence with Michael S. Hart back in the 90's as well, and made a few modest contributions to the project, which made my English major undergraduate heart swell with pride and joy.

I guess this is only to say that PG is special to me for these reasons, and I am glad to see it still thriving. <3

From Italy, https://www.gutenberg.org/ gives a 404 error and https://gutenberg.org/ opens a very official-looking page stating "police notice. This site is under judicial seizure" and references a sentence number: "criminal proceedings 52127/20 R.N.R.I. tribunal of Rome"

Any idea what's happening? I thought PG published public domain books...

I'm surprised no eBook Reader vendor has a Project Gutenberg "Store." Where you can just browse Gutenberg, find a book, and just grab it down to the reader. Instead, they either are actively hostile (Kindle), or require the use of Calibre (which itself is good, it is just the friction).

Nice to see so much appreciation for what we do. (I'm the new-ish executive director.) Any wikipedians reading this, the article about PG is... aging. Last I looked, it said we offered Plucker files. @Jseiko has done some nice work.

Project Gutenberg is a treasure trove, though many technical details defy automatic typesetting of its books. Standard Ebooks takes consistency to an unbelievable level. My post compares various sources of public domain books with an eye on typesetting:

https://dave.autonoma.ca/blog/2020/04/11/project-gutenberg-p...

Worth mentioning the Project Gutenberg ZIMs. You can download the entire ENglish Gutenberg corpus for about 60GB (English Wikipedia ZIM complete with images is ~120GB):

https://ebookfoundation.org/openzim.html

Project Gutenberg had (has?) a tendency toward plaintext that always put me off. (And it has been over a decade I'm sure since I explored the site—so I am no doubt now misinformed.)

I like a styled formatted book—would prefer PDFs. (I know, not a popular format apparently.)

I like the idea of Project Gutenberg but guess I found book scans on archive.org my preference.

My go-to example is Lewis Carroll's "Through the Looking Glass" with the fantastic art of John Tenniel and Carroll's sometimes creative formatting of the prose…

I see they (Project Gutenberg) have ePub now, which can be good if well done.

(If not well done it can be a kind of mess. Re-flowable "HTML", paginated… Anyone ever try to print a long web page and did you enjoy the result? Perhaps that is as much on the ePub reader though.)

Looks like the top downloaded book yesterday[0] was Concrete Construction: Methods and Costs by Gillette and Hill.[1] Beat out Moby Dick, Count of Monte Cristo, Frankenstien, Romeo and Juliet, and others.

> 23644 downloads in the last 30 days.

I wonder if this is bot behavior? 23k downloads feels like a lot?

[0] https://www.gutenberg.org/browse/scores/top [1] https://www.gutenberg.org/ebooks/24855

Gutenberg is awesome. There is also

https://www.fadedpage.com/ from Canada I think

https://runeberg.org/ from Sweden

As a Kindle user, I still miss the old version of the site. The new one looks great on normal desktop, but the old one was simple enough to load and directly download books on the device's built-in browser.

The project was geo-blocked in Germany for a long time: https://news.ycombinator.com/item?id=29024039

Keep up the awesome work !

Made an app that allows reading PG books as audiobooks on iPhone https://loudreader.io/

I remember printing out project Gutenberg books in the mid-90s, four regular pages to an A4 page, double-sided on my inkjet. I had a background in typography, so I made it work.

Any yes, the text needed a lot of processing to make it right.

Now, in my early fifties and with declining eyesight, that's out of reach now.

Thanks for sticking with the project!

A big pet peeve of mine with Project Gutenberg was the lack of mobile styling. Looks like it’s been fixed! Awesome.

Project Gutenberg feels like the opposite of modern internet design philosophy. Quiet, useful, accessible, and built to last.

Recently downloaded Moby Dick from here:) very easy to use

I wonder if the people behind project Gutenberg use Anna's Archive or mam for books that can't be put on Gutenberg.

I'm slightly curious how PG handles heavily illustrated books. I've downloaded some years ago, and the quality of the illustrations was always pretty poor. Has it been improved lately? What's the QA like for illustrations?

I love how usable the site is even with JS disabled!

I thought this was for the Wordpress Gutenberg Editor for a second

I find it interesting that the context of this comments page apparently overrides the normal definition of “PG” on HN.

PG remains one of the best things on the internet. The amount of fascinating material almost beggers belief.

Their feeds of new books is a goldmine:

https://www.gutenberg.org/ebooks/feeds.html

Every day you'll get much more than you're bargaining for, right into your feed or inbox. Easy download books you're interested in and put them on your Kindle.

Please give me some book recommendations :)

I keep getting PR_CONNECT_RESET_ERROR

Thank you for reminding me about this project. Didn’t visit it in a long time.

How did "Concrete Construction: Methods and Costs" come to be the #1 download?

I can't read anymore due to fear of not being productive with AI

https://dave.autonoma.ca/blog/2020/04/11/project-gutenberg-p...

Worth mentioning the Project Gutenberg ZIMs. You can download the entire ENglish Gutenberg corpus for about 60GB (English Wikipedia ZIM complete with images is ~120GB):

https://ebookfoundation.org/openzim.html

Gutenberg is awesome. There is also

https://www.fadedpage.com/ from Canada I think

https://runeberg.org/ from Sweden

Project Gutenberg feels like the opposite of modern internet design philosophy. Quiet, useful, accessible, and built to last.

I wonder if the people behind project Gutenberg use Anna's Archive or mam for books that can't be put on Gutenberg.

I love how usable the site is even with JS disabled!

I thought this was for the Wordpress Gutenberg Editor for a second

Thank you for reminding me about this project. Didn’t visit it in a long time.

While PG has probably gotten a lot of use and growth with the growth/maintreaming of the Internet since the 1990s, (TIL) it started back in 1971:

* https://en.wikipedia.org/wiki/Project_Gutenberg

"Project Gutenberg began in 1971 when Michael Hart was given an operator’s account with $100,000,000 of computer time in it by the operators of the Xerox Sigma V mainframe at the Materials Research Lab at the University of Illinois."

https://www.gutenberg.org/about/background/history_and_philo...

wikipedians, please help update this article.

Any idea what's happening? I thought PG published public domain books...

It was also blocked in Germany for a while due to a court order https://cand.pglaf.org/germany/index.html

Found: it's a sentence from 2020, and PG decided not to appeal (!?)

Full story (in Italian) at https://www.wired.it/internet/web/2020/06/30/progetto-gutenb...

I asked Claude to research the background story: "In May 2020, the Court of Rome ordered Italian ISPs to seize/block a list of domains as part of a criminal case (the 52127/20 R.N.R. you're seeing) targeting sites and Telegram channels distributing pirated newspapers and magazines. 28 domains were on the list, and Project Gutenberg got thrown in alongside the actual pirate sites."

apparently this situation hasn't been resolved yet

I've used https://standardebooks.org/ to pull nicely formatted Project Gutenberg books on any e-reader that supports a browser (in my case, Boox).

Technically, I can also just directly pull the epub from Project Gutenberg, but sometimes the formatting leaves a lot to be desired.

Once you get an e-reader that runs a semi-capable OS (ex - stock android, even an older version), it's hard to go back to something like a kindle.

If you don’t strip the Project Gutenberg license from the book text (leaving only the book text, which no-one disputes is public domain and freely distributable), you are required to give “pay a royalty fee of 20% of the gross profits you derive from the use of Project Gutenberg-tm works calculated using the method you already use to calculate your applicable taxes”

https://www.gutenberg.org/policy/license.html

[Way back in the early days of the iPhone, I sold a book reading app which was backed directly by Project Gutenberg texts, called “Eucalyptus”. I sent 20% of the gross profits to PG - which was never less than very supportive of the app - and felt good about doing so.]

Most of them offer their own paid storefronts and have a perverse incentive not to offer a large area full of free books.

Used to be one could sort of get that with the Project Librivox:

https://librivox.org/

e-book app Gutebooks (in addition to their audio app), but it seems to have been deprecated (I'm no longer able to connect to the server on my copy (which I only got 'cause there was an in-app purchase to fund Project Librivox).

FWIW, Barnes & Noble has been plundering the public domain using a book composition/keying house in the Philippines to make their public domain books which they make available in their stores --- Amazon apparently has a similar setup for the Kindle Store:

https://www.amazon.com/Public-Domain-Books-Kindle-Store/s?k=...

Rather a shame that PG didn't monetize by putting their books up there pre-emptively.

I've heard that the newest Kobo e-readers have a browser that you could use to go to gutenberg.org and directly download files.

but yes, generally I agree with your point. Library of 75k books seems pretty valuable to have direct access to.

You can download books directly from the Project Gutenberg website using the web browser on most eBook readers - even the Kindle supports it.

Project Gutenberg had (has?) a tendency toward plaintext that always put me off. (And it has been over a decade I'm sure since I explored the site—so I am no doubt now misinformed.)

I like a styled formatted book—would prefer PDFs. (I know, not a popular format apparently.)

I like the idea of Project Gutenberg but guess I found book scans on archive.org my preference.

My go-to example is Lewis Carroll's "Through the Looking Glass" with the fantastic art of John Tenniel and Carroll's sometimes creative formatting of the prose…

I see they (Project Gutenberg) have ePub now, which can be good if well done.

(If not well done it can be a kind of mess. Re-flowable "HTML", paginated… Anyone ever try to print a long web page and did you enjoy the result? Perhaps that is as much on the ePub reader though.)

We're supporting EPUB3 for the vast majority of books! At the same time we also have a "Plain Text" version for each as in a sense it's the most robust. PdFs are in the works!

As others here have mentioned, https://standardebooks.org/ is excellent and my understanding is that they use Gutenberg books as a source for theirs but done up much nicer.

I love, love, looove the fact that I can have a book's html version on project gutenberg bookmarked and continue to read across devices without ever having to login. I use the browser's inbuilt capability extensively to enhance my reading experience (fonts, backgrounds, text to speech, print formatting, share snippets). None of this is a good experience with pdf, epub or any other format.

I've read more (meaningful) text on PG than any other digital platform. Huge fan. Thanks for all the work and for keeping it clean and free

Check out Standard eBooks. They take the text from Gutenberg and add a level of polish to the ePubs.

I on the other hand prefer epubs for fiction. I mostly read on the phone.

The common issue with PDFs is that e-readers generally have terrible support for them.

PDF coming this year.

I have got quite a few books over the years from Gutenberg, and the epubs have been fine 0 even of illustrated ones.

I like plain text. You can always post process it into any other format you prefer.

> 23644 downloads in the last 30 days.

I wonder if this is bot behavior? 23k downloads feels like a lot?

[0] https://www.gutenberg.org/browse/scores/top [1] https://www.gutenberg.org/ebooks/24855

Haha well there is an exciting movie about concrete coming out, “The History of Concrete” by John Wilson. Surely the superfans are studying up

bot traffic would be my guess too. I doubt there was a sudden global spike in interest in "Concrete Construction Methods" :D

That's interesting. What about the new design prevents you from doing it? Genuinely asking here. We may fix it if it's actionable

Is that a Kindle issue?

You can download books in most browsers. I know Amazon have done things to make life difficult for other stores in the past.

The project was geo-blocked in Germany for a long time: https://news.ycombinator.com/item?id=29024039

Project Gesperrtberg

very glad this has been resolved (I'm from Germany myself)

Made an app that allows reading PG books as audiobooks on iPhone https://loudreader.io/

I remember printing out project Gutenberg books in the mid-90s, four regular pages to an A4 page, double-sided on my inkjet. I had a background in typography, so I made it work.

Any yes, the text needed a lot of processing to make it right.

Now, in my early fifties and with declining eyesight, that's out of reach now.

Thanks for sticking with the project!

that's cool! one of my "pet-ideas" is actually to make an AI-agent that does all that typographical work for any PG book to make it nicely printable without any manual labor whatsoever. Maybe that's doable now ...

A big pet peeve of mine with Project Gutenberg was the lack of mobile styling. Looks like it’s been fixed! Awesome.

good to hear - that was a lot of work!

Recently downloaded Moby Dick from here:) very easy to use

Moby Dick is consistently one of the Top Downloads

Nowadays we depend on scans from Internet Archive, Hathitrust, and other sources. Some scans are better than others. Bear in mind that our illustrations need to be in the public domain and usually from the same edition as the text. https://www.gutenberg.org/help/errata.html

I find it interesting that the context of this comments page apparently overrides the normal definition of “PG” on HN.

PG remains one of the best things on the internet. The amount of fascinating material almost beggers belief.

the amount of weird/interesting stuff that one would find nowhere else is possibly the coolest aspect of PG imo

Their feeds of new books is a goldmine:

https://www.gutenberg.org/ebooks/feeds.html

Every day you'll get much more than you're bargaining for, right into your feed or inbox. Easy download books you're interested in and put them on your Kindle.

I used to use the Online Books Page new books listing similarly:

https://onlinebooks.library.upenn.edu/new.html

Please give me some book recommendations :)

Flatland: https://www.gutenberg.org/ebooks/search/?query=flatland

I've heard good things. Also - Sherlock Holmes :)

Not a recommendation per se but I used to use Amphetype on Gutenberg texts to practise touch-typing. There's something about writing out a book that hits differently to reading it. You skip less, odd parts stick with you. I think the last one I tried was The Island of Dr Moreau.

From the newest releases page I stumbled into "Some Nigerian fertility cults" by Percy Amaury Talbot & am enjoying it so far.

https://www.gutenberg.org/ebooks/78684

Have you considered having a detailed version history for each book (etext)? The process of submitting fixes to typos etc in books involves sending an email (https://www.gutenberg.org/help/errata.html) and although the last time I did this (2011) the fixes did get applied reasonably quickly (couple of days), it all felt a bit opaque. The version history could also include the project (usually PGDP correct?) the etext originated from; that way one would be able to compare against the actual page scans.

I have very mixed feelings about Standard Ebooks and would much prefer being able to use Project Gutenberg directly, but one good thing Standard Ebooks does is that every book has an associated git repository (on GitHub), so it's (in principle) possible to see a history of fixes to the text over time.

As long as you're taking suggestions, since many of the books are quite old, adding a publication date or date range to the search functionality might be nice. I personally would find it very useful since I have a tendency to look for things that are older than year _x_ when researching various things.

Thanks for all the effort put into the site!

When I thought about Project Gutenberg I remembered that original brutalist non-design. The current site has been very tastefully updated but looks like it's still very accessible if you turn styles off. Great job!

Huh that's interesting: 4.5 seconds for the TCP handshake and an additional 9.2 seconds for the TLS handshake. Is this some kind of captcha, since most bots would disconnect before that, so if you complete it once then it knows you're good? (Until the bots catch on of course, but so long as it works it's relatively unintrusive and not discriminatory against uncommon client software (that is, non-Chrome/ium).) The rest of the requests were lightning fast

Edit: welcome to your first comment after 9 years on HN btw, nice to have you here!

The book list elements on front page render as both horizontally and vertically scrollable divs on mobile - seems like an opportunity for improvement.

Keep up the good work!

Thank you for your work. This site is an international treasure.

Thanks for the free work! Project Gutenberg is nice to have :).

On the site I noticed the library boxes have roughly a single extra line causing a scrollbar to appear and the last line to be chopped off https://i.imgur.com/PQ8T0qc.png is there an issues/bug portal to properly submit these kinds of things?

Thank you for being one of the best places on the internet

Oh, my! This does look nice. Thank you for your hard work!

There's a minor bug with chrome in android where the menu will not close when you tap outside the menu or on the menu link/button

Thank you for your work. This site is an international treasure.

Thank you for being one of the best places on the internet

Haha well there is an exciting movie about concrete coming out, “The History of Concrete” by John Wilson. Surely the superfans are studying up

It's got better reviews on Goodreads than Moby Dick too. I know what I'm reading next

bot traffic would be my guess too. I doubt there was a sudden global spike in interest in "Concrete Construction Methods" :D

That's interesting. What about the new design prevents you from doing it? Genuinely asking here. We may fix it if it's actionable

And now it's time to put my foot in my mouth. I haven't used it in a while because it was frustrating, but you guys seem to have already fixed it :)

The previous version of the site had two major flaws:

1. The search bar had been removed from the top of the page, and hidden behind a "Click here to search" (or similar) link partway down the page

2. Once you opened that page, the coloring of the site was so washed out on e-ink that the text input was hard to find.

Thanks for fixing it!

Is that a Kindle issue?

You can download books in most browsers. I know Amazon have done things to make life difficult for other stores in the past.

I'd call it one of those middle-ground things:

• On the one hand, E Ink devices have a fairly known set of limitations, and it would be ridiculous for me to expect them to render the whole web well.

• On the other hand, it's good for website designs to consider the kind of devices employed by their users. Using a Kindle to access Gutenberg is likely less of an edge case than it would be for other sites, so it's worth the extra design work.

(Keep in mind that -- given my sibling comment -- this is all theoretical. The latest iteration of Gutenberg's site is much better than the previous version)

Project Gesperrtberg

very glad this has been resolved (I'm from Germany myself)

That is doable. Most of my work was regexp and repetitive stuff. And the typograhpy stuff is achievable with the current state of the art models. Not that I remember what I did, it was 30 years ago.

good to hear - that was a lot of work!

Moby Dick is consistently one of the Top Downloads

the amount of weird/interesting stuff that one would find nowhere else is possibly the coolest aspect of PG imo

I used to use the Online Books Page new books listing similarly:

https://onlinebooks.library.upenn.edu/new.html

Flatland: https://www.gutenberg.org/ebooks/search/?query=flatland

I've heard good things. Also - Sherlock Holmes :)

From the newest releases page I stumbled into "Some Nigerian fertility cults" by Percy Amaury Talbot & am enjoying it so far.

https://www.gutenberg.org/ebooks/78684

I can't read anymore due to fear of not being productive with AI

maybe there's a way to read more productively using AI: https://x.com/karpathy/status/1990577951671509438

could be a trick to ease that fear :D

I keep getting PR_CONNECT_RESET_ERROR

just heard back that the server provider has been doing a security update. Maybe you were one of the users that got unlucky as a result... maybe try later if still interested

How did "Concrete Construction: Methods and Costs" come to be the #1 download?

good question. first though - maybe some bot has downloaded it often for whatever reasons and our systems didn't detect it as bot traffic. just a guess.

We're using git repos internally to keep history for each book. They existed on github for a while, but our implementation was awkward, and too big of project for the volunteer dev team. But it's likely that we'll evolve towards that.

I believe our new-ish CEO Eric Hellman actually did some work on something very similar

That's an interesting idea. not a small feat to accomplish though ...

sadly HN doesn't have a "heart" emoji I could use :D

Edit: welcome to your first comment after 9 years on HN btw, nice to have you here!

I think their site is just slow, potentially because more people than they are used to are trying to view it.

I was unable to load it initially (got an error from firefox) and had to re-attempt. Still slow if one forces a reload (shift-r, etc, to not use local cache).

we are having occasional lows in page speed performance due to LARGE amounts of bot traffic. full disclosure - we've not really been able to resolve this fully/well. Let us know if you have a good idea for how to deal with it

The book list elements on front page render as both horizontally and vertically scrollable divs on mobile - seems like an opportunity for improvement.

Keep up the good work!

good feedback thanks! Doing an iteration on the homepage design is actually pretty high on the priority list. will keep your feedback in mind!

Thanks for the free work! Project Gutenberg is nice to have :).

Oh, my! This does look nice. Thank you for your hard work!

Thanks! We're currently working on a design update of the page of any specific book. Should be online soon (next 1-2 weeks or so)

There's a minor bug with chrome in android where the menu will not close when you tap outside the menu or on the menu link/button

I've messaged the guy who's best suited to fixing this. He'll be on it this weekend

will open an "Issue" for it

just heard back that the server provider has been doing a security update. Maybe you were one of the users that got unlucky as a result... maybe try later if still interested

good question. first though - maybe some bot has downloaded it often for whatever reasons and our systems didn't detect it as bot traffic. just a guess.

I believe our new-ish CEO Eric Hellman actually did some work on something very similar

That's an interesting idea. not a small feat to accomplish though ...

I think their site is just slow, potentially because more people than they are used to are trying to view it.

I was unable to load it initially (got an error from firefox) and had to re-attempt. Still slow if one forces a reload (shift-r, etc, to not use local cache).

Thanks! We're currently working on a design update of the page of any specific book. Should be online soon (next 1-2 weeks or so)

Keep up the awesome work !

https://www.gutenberg.org/about/background/history_and_philo...

apparently this situation hasn't been resolved yet

https://www.gutenberg.org/policy/license.html

I've heard that the newest Kobo e-readers have a browser that you could use to go to gutenberg.org and directly download files.

but yes, generally I agree with your point. Library of 75k books seems pretty valuable to have direct access to.

You can download books directly from the Project Gutenberg website using the web browser on most eBook readers - even the Kindle supports it.

Check out Standard eBooks. They take the text from Gutenberg and add a level of polish to the ePubs.

I on the other hand prefer epubs for fiction. I mostly read on the phone.

The common issue with PDFs is that e-readers generally have terrible support for them.

PDF coming this year.

I have got quite a few books over the years from Gutenberg, and the epubs have been fine 0 even of illustrated ones.

I've messaged the guy who's best suited to fixing this. He'll be on it this weekend

will open an "Issue" for it

Maybe include a "Lite" version that only displays text/links? No to minimal styling would be great!

And now it's time to put my foot in my mouth. I haven't used it in a while because it was frustrating, but you guys seem to have already fixed it :)

The previous version of the site had two major flaws:

1. The search bar had been removed from the top of the page, and hidden behind a "Click here to search" (or similar) link partway down the page

2. Once you opened that page, the coloring of the site was so washed out on e-ink that the text input was hard to find.

Thanks for fixing it!

I'd call it one of those middle-ground things:

• On the one hand, E Ink devices have a fairly known set of limitations, and it would be ridiculous for me to expect them to render the whole web well.

(Keep in mind that -- given my sibling comment -- this is all theoretical. The latest iteration of Gutenberg's site is much better than the previous version)

That is doable. Most of my work was regexp and repetitive stuff. And the typograhpy stuff is achievable with the current state of the art models. Not that I remember what I did, it was 30 years ago.

personally I'm a fan of the other "PG" as well.

Ulnar Nerve Entrapement :/

maybe there's a way to read more productively using AI: https://x.com/karpathy/status/1990577951671509438

could be a trick to ease that fear :D

I've found that the larger open-weight AI models do a great job of explaining the old non-fiction content on PG, particularly magazine articles which are a good size for the AI to handle. It breaks down the long wall-of-text paragraphs for you and explains all the historically relevant background that would've been assumed to be known back in the day.

If you ask it to assess the relevance of the text in the present day it will also do that very nicely, highlighting the places where the text shows old-fashioned viewpoints that would be sharply criticized today.

sadly HN doesn't have a "heart" emoji I could use :D

Less than three is a classic!

Do you host a torrent?

I have about 50k of the books, I would have used a torrent of just the txt files if it was prominent.

I'm only a small-scale sysadmin but the way that I understand the internet is that you send abuse notifications to the IP address block owner and, if it doesn't get resolved, you block. The whois/rdap database reveals which IPs all belong to the same hosting provider or ISP, so you can summarize that all to one list of IP addrs + timestamps per some time period

The ISP actually knows which subscriber is on that line, can send them notices, block them, terminate them... loads of things that you simply cannot do because you have no relation to this person. And frankly I wouldn't want to need to have a personal relation with every website that I visit; my ISP can reach me if there is anything relevant to continued use of the internet. From personal experience, when I was a teenager, the ISP cutting our household off after an abuse report was an effective way of stopping what I was doing

good feedback thanks! Doing an iteration on the homepage design is actually pretty high on the priority list. will keep your feedback in mind!

Any interest in offering PG as a multi-lingual web e-reader in any language?

I've since discontinued hosting it, but happy to add you all and merge into an official PG offering: https://www.reddit.com/r/SideProject/s/VtYKxjrMme

I guess this is only to say that PG is special to me for these reasons, and I am glad to see it still thriving. <3

wikipedians, please help update this article.

Found: it's a sentence from 2020, and PG decided not to appeal (!?)

Full story (in Italian) at https://www.wired.it/internet/web/2020/06/30/progetto-gutenb...

I've used https://standardebooks.org/ to pull nicely formatted Project Gutenberg books on any e-reader that supports a browser (in my case, Boox).

Technically, I can also just directly pull the epub from Project Gutenberg, but sometimes the formatting leaves a lot to be desired.

Once you get an e-reader that runs a semi-capable OS (ex - stock android, even an older version), it's hard to go back to something like a kindle.

Most of them offer their own paid storefronts and have a perverse incentive not to offer a large area full of free books.

Used to be one could sort of get that with the Project Librivox:

https://librivox.org/

https://www.amazon.com/Public-Domain-Books-Kindle-Store/s?k=...

Rather a shame that PG didn't monetize by putting their books up there pre-emptively.

We're supporting EPUB3 for the vast majority of books! At the same time we also have a "Plain Text" version for each as in a sense it's the most robust. PdFs are in the works!

As others here have mentioned, https://standardebooks.org/ is excellent and my understanding is that they use Gutenberg books as a source for theirs but done up much nicer.

I like plain text. You can always post process it into any other format you prefer.

personally I'm a fan of the other "PG" as well.

Ulnar Nerve Entrapement :/

This was very touching, thanks for sharing. Sorry for your loss.

this is so great to hear! Distributed proofreaders (the org that actually does transcriptions) is still looking for volunteer should you feel the urge/inclination :) https://www.pgdp.net

In what way? And from what sources? (Wikipedia as a tertiary source is supposed to be a summary of information present in reliable secondary sources — see for instance https://en.wikipedia.org/wiki/Wikipedia:Based_upon. So if the information on the Wikipedia article is incomplete or out of date, where is the correct information available?)

Seems like a case for HTTP 451 (Unavailable for Legal Reasons) rather than 404.

It looks like the issue was that, in Italy, copyright expires 70 years after the death of the author or the first translator of a work.

To be precise, the vast majority of SE is from Gutenberg, but we also source from Faded Page, Gutenberg Australia, Wikisource and occasionally do our own transcriptions.

HTML editions from the two sites contrast interestingly:

https://www.gutenberg.org/cache/epub/1513/pg1513-images.html

https://standardebooks.org/ebooks/william-shakespeare/romeo-...

Each has its particular advantages relative to the other ...

standardebooks.org is great!

Hacker Times

Hacker Times

Project Gutenberg – keeps getting better

Discussion

Discussion