OP here, I found this website when I was looking for a way to get the updated version (with correction) of a PG (Project Gutenberg) book and all changes/diffs from the point that I scrape the book from PG website for my language learning side project: https://www.bilinual.com
The bilinual Project also rebuild ebooks (modern HTML, PDF and open ePUB format) with better quality and readability( while it is not its prime goal). Take a look at one example here:
For example, many words don't have translations at all, and those that do are often incorrect. This feels like a very rough machine translation? For example:
> et c'est surtout dans les paroisses riveraines du Saint-Laurent
You translate this
> and Ce east primarily in the · · some saint Laurence
While Google Translate gives
> and it is especially in the parishes bordering the St.Lawrence
If you're using machine translation, why not use a Google API that might give usable results at least? If that's not plausible, maybe you should try to get together a team of volunteers to manually translate these ebooks for language learners?
(I hope these suggestions are helpful, I'm not trying to be dismissive of your project.)
Me too! GITenberg started with a variety of goals, some of which were achieved, and others which were less so. Outstanding, considered as a prototype. Standard Ebooks has been much more successful at creating a community working to improve the quality of PG-derived ebooks. We're on the same team.
bilinualcom|5 years ago
The bilinual Project also rebuild ebooks (modern HTML, PDF and open ePUB format) with better quality and readability( while it is not its prime goal). Take a look at one example here:
https://www.bilinual.com/book/18043sven/sv/en#line=68&lpp=23 https://www.bilinual.com/download/18043sven-sv-en.pdf https://www.bilinual.com/download/18043sven-sv-en.epub
bscphil|5 years ago
* One of them 404ed: https://www.bilinual.com/download/30117fren-fr-en.pdf
* The other was full of problems: https://www.bilinual.com/download/16210fren-fr-en.pdf
For example, many words don't have translations at all, and those that do are often incorrect. This feels like a very rough machine translation? For example:
> et c'est surtout dans les paroisses riveraines du Saint-Laurent
You translate this
> and Ce east primarily in the · · some saint Laurence
While Google Translate gives
> and it is especially in the parishes bordering the St.Lawrence
If you're using machine translation, why not use a Google API that might give usable results at least? If that's not plausible, maybe you should try to get together a team of volunteers to manually translate these ebooks for language learners?
(I hope these suggestions are helpful, I'm not trying to be dismissive of your project.)
yesenadam|5 years ago
gluejar|5 years ago