Ask HN: Review DocumentCloud.org
We just opened up the public version of the workspace, and I'd be obliged if y'all could take a look, and share your thoughts. There has been a lot of interest in seeing a real-world example of a webapp that uses Backbone.js, Underscore.js, and Jammit -- and this is the application that all of our open-source libraries have been extracted from.
To get your hands dirty, visit documentcloud.org/public, and try a search for "filter: annotated". Then, pop open your JS console, and you can play around with the Backbone models:
Documents.first().get("title");
Documents.map(function(doc){ return doc.get("title"); });
Documents.first().notes.fetch();
Documents.first().notes.first().set({title: "Testing..."});
... don't worry, that last one didn't persist anything to the server. If you try a .save(), it'll be denied.A couple other fun things to try:
* Drag a box to select a couple documents, and choose "Analyze -> View Timeline".
* Pop open the entities tab, and click on "show pages" next to an entity to view all of their mentions in the text.
* Click on a document's page count...
I'd love to hear your thoughts, from both technical and design perspectives, and would be glad to answer any questions that you have about the app.
[+] [-] jamesjyu|15 years ago|reply
So, to the DocumentCloud team I say: a big THANK YOU!
[+] [-] jashkenas|15 years ago|reply
[+] [-] kin|15 years ago|reply
[+] [-] nathan82|15 years ago|reply
On the document search page:
* Double-click to open isn't intuitive for me on a website. Had a few puzzled seconds wondering how to open a document.
* Middle-click to open is broken.
* If I try to do a drag selection starting over a thumbnail, it doesn't work.
* The browser right-click context menu isn't always suppressed, and completely obscures your custom menu. Maybe float it to the left of the cursor so both are visible?
* Seems like the thumbnail/list view switcher buttons are the wrong way round.
* If I select multiple documents then click download, nothing happens. Either grey out the option, or add zip downloading.
* Selecting 'view pages' when in thumbnail view breaks the layout.
* The 'Page x of x' navigation at the bottom is non standard without being any more useful. Found it annoying having to actually type in the page I wanted to go to. Duplicating the pagination nav at the top of the page would be nice.
* Left-hand document list is well thought out, like that the scrollwheel works, but it still feels a bit awkward. Not sure why.
* 'Entities' seems too specific to describe what are essentially tags. Caused a bit of confusion. Phone numbers and 'terms' are not entities.
* The 'Log In' button is a bit out of context next to the other buttons. Maybe move it to the top right next to your logo?
* As the document links are taking me off-site, I'd like the statusbar url to still appear on hover.
* Found myself focusing the search box just so the text was dark enough to read.
Document viewer:
* The document sidebar might be better on the left. Having it on the right breaks Fitt's law for the scrollbar.
* Arrow key and spacebar scrolling doesn't work on document load. You have to click to focus the document first.
* The document page navigation might be better in the top bar, rather than the sidebar. Having it in the sidebar gave me the false expectation of sidebar content that changed related to the page I was on.
* The 'Pages' thumbnail view only seems to show up for some documents.
* The zoom level I set for the 'Pages' view also changes the zoom level for the 'Document' view. So I can't easily browse with tiny thumbnails but fullsize documents.
* Annotation UI is great, very intuitive. The restrained use of colour on the rest of the site really pays off, that yellow is hard to miss.
* With two notes parallel to each other cause their markers to overlap, example:
http://www.documentcloud.org/documents/29627-cosa-letter-on-...
* From the 'notes' view I should be able to click on the passage in the note to get back to its original document context.
[+] [-] jashkenas|15 years ago|reply
[+] [-] evilchelu|15 years ago|reply
I've been working with MVC style frameworks for quite a while on the server-side but until backbone I've never been able to get my head around how to create a gui app using MVC. It was never clear where and how to put things and before I knew it, everything would go spaghetti on me.
I've done lots of js and lots of js intensive apps and I always hated it because of all the lack of structure and mix of concerns everywhere.
For the last two months, I've been using backbone.js, underscore.js, jammit and coffeescript (which is also made by jashkenas) in a quite complex app. Because of them I was able to massively rework things based on changing requirements without ending up with dead code, or strange pieces that are somehow working but not really needed.
I can't say this enough. Backbone.js really helped put order in my code and it was so easy to pick up that I am convinced you're really hurting yourself by not using it. The documentation is amazingly well done and has real examples of how you'd use the code, instead of just being generated from arguments and other relatively unimportant stuff.
Thanks jashkenas and samuelclay! And congrats on finally launching documentcloud.org. Keep kicking ass!
[+] [-] SupremumLimit|15 years ago|reply
[+] [-] jashkenas|15 years ago|reply
* Open-source projects: http://www.documentcloud.org/opensource
* Public search: http://www.documentcloud.org/public/#search/
* Just annotated docs: http://www.documentcloud.org/public/#search/filter%3A%20anno...
* Deepwater Horizon Spill Docs: http://www.documentcloud.org/public/#search/deepwater%20hori...
[+] [-] hieronymusN|15 years ago|reply
[+] [-] grandalf|15 years ago|reply
I'm a huge fan of backbone.js and underscore.js. Fantastic work.
[+] [-] jashkenas|15 years ago|reply
[+] [-] dustineichler|15 years ago|reply
Hats off, i'm inspired.
[+] [-] habitatforus|15 years ago|reply
I don't mean "They did it don't bother". It might be that your are making a non-profit Scribd focused on social news, and you'll be better because _________.
There is a need for this. There is a general distrust of media, and people want to make their own decision -- your empowering them.
PS. I like the design.
[+] [-] jashkenas|15 years ago|reply
We're a nonprofit organization funded by the Knight Foundation to help make primary source documents accessible. Upload access to DocumentCloud is restricted to journalists -- the current list of contributors can be found here:
http://www.documentcloud.org/contributors
As to the Document Viewer, which is probably what you're talking about (the equivalent of Scribd's embedded viewer), it's funny that you mention the Times, because it's their project. Alan McLean, at the Interactive News desk, created the document viewer for nytimes.com, and they make extensive use of it there (apart from Dealbook). It's also an HTML5 viewer, not a Flash viewer.
http://documents.nytimes.com/court-battle-over-rahm-emanuels...
http://www.documentcloud.org/#search/group%3A%20nytimes
PS. The design is thanks to the very talented Folkert Gorter (superfamous.com)
[+] [-] kluikens|15 years ago|reply
I agree with nathan82 about being puzzled for a few seconds on how to open a document.
Any chance you guys have plans, or even an idea, to extend a similar app to public record custodians? If so.. I'll have to watch out. :)
[+] [-] jashkenas|15 years ago|reply
As to opening it up for public record custodians ... we don't want DocumentCloud to become a general dumping ground for every government document under the sun -- sticking to newsworthy documents has worked well so far. But at the same time, it's silly to refuse a document that the government wants to publish, and then turn around and accept that same document from a reporter the next day. I think there's a fine line to walk there, while keeping the content of the catalog relevant and searchable, and we'll have to try and find it.
For the record, our policy for contributors is currently "anyone who reports on primary source documents" ... which includes newsrooms like the Washington Post as well as individual bloggers like SCOTXBlog.com.
[+] [-] acconrad|15 years ago|reply
[+] [-] bdr|15 years ago|reply