Show HN: Reor – An AI note-taking app that runs models locally

kepano|2 years ago

This is a good reminder of why storing Obsidian notes as individual Markdown files is much more useful than stuffing those notes in a database and having Markdown as an export format. The direct manipulation of files allows multiple apps to coexist and do useful things on top of the same files.

Ringz|2 years ago

That was the reason why I gave up Joplin very quickly. The last Joplin thread, here on Hacker News, has also shown once again that some still do not understand why "But Joplin can export Markdown from the database!" is not the same as simple, flat Markdown files.

michaelmior|2 years ago

It's very possible to have multiple apps coexisting using a database. Although I'll certainly concede that it's probably a lot easier with just a bunch of Markdown files.

toddmorey|2 years ago

Yes it was one of the best product decisions y'all made. Been so useful to have direct access to the files and options on how my data is processed and backed up.

pipnonsense|2 years ago

The OP and your comment just made me cancel my Milanote subscription, export all my notes to markdown and start using Obsidian (to later experiment with this Reor).

As a side-effect, I just noticed that I prefer a long markdown file with proper headings (and an outline on the side) than Milanote's board view, which initially felt like a more free form better suited for unorganized thoughts and ideas for writing that I had (I use it for my fiction writing).

I still can have documents as a list of loose thoughts, but once I am ready to organize my ideas, I just use well written and organized headers, edit the content and now I have a really useful view of my idea.

snthpy|2 years ago

Is a filesystem not a database with a varchar unique primary key, a blob data attribute and a few more metadata fields?

pokstad|2 years ago

Files seem less useful for small bits of information. I feel the urge to fill a file with a minimum threshold. A database makes more sense for that.

xenodium|2 years ago

I got an iOS journaling app on beta. It’s offline, no sign-in, no lock-in, social, etc. Saves to plain text. Syncs to your desktop if needed.

https://xenodium.com/an-ios-journaling-app-powered-by-org-pl...

samlhuillier|2 years ago

Absolutely! Really respect the work you folks are doing.

toddmorey|2 years ago

"crucially, that AI should run as much as possible privately & locally"

Just wanted to say thank you so much for this perspective and fighting the good fight.

samlhuillier|2 years ago

Thank you!

humbleferret|2 years ago

Great job!

I played around with this on a couple of small knowledge bases using an open Hermes model I had downloaded. The “related notes” feature didn't provide much value in my experience, often the link was so weak it was nonsensical. The Q&A mode was surprisingly helpful for querying notes and providing overviews, but asking anything specific typically just resulted in less than helpful or false answers. I'm sure this could be improved with a better model etc.

As a concept, I strongly support the development of private, locally-run knowledge management tools. Ideally, these solutions should prioritise user data privacy and interoperability, allowing users to easily export and migrate their notes if a new service better fits their needs. Or better yet, be completely local, but have functionality for 'plugins' so a user can import their own models or combine plugins. A bit like how Obsidian[1] allows for user created plugins to enable similar functionality to Reor, such as the Obsidan-LLM[2] plugin.

[1] https://obsidian.md/ [2] https://github.com/zatevakhin/obsidian-local-llm

dax_|2 years ago

Yeah, this is exciting - I'd much rather have it as a plugin for Obsidian though! I have my workflow with that, all the features I need. Having some separate AI notes app isn't what I would like to use.

samlhuillier|2 years ago

Thank you for your feedback!

Working hard on improving the chunking to improve related notes section. RAG is fairly naive right now, with lots of improvements coming in the next few weeks.

ilaksh|2 years ago

Which model exactly did you use and how large? I feel like even the best 7b models are just a bit too dumb for most things that I have tried. A 70b model or Mixtral or sometimes 34b seem to be adequate for some things. But those are several times larger and don't run on my oldish hardware.

mcbetz|2 years ago

Interesting project, wishing you all the best!

If you are using Obsidian, Smart Connections in v2 (1) does also support local embeddings and shows related notes based on semantic similarity.

It's not super great on bi/multi-lingual vaults (DE + EN in my case), but it's improving rapidly and might soon support embedding models that cater for these cases as well.

(1) https://github.com/brianpetro/obsidian-smart-connections

fastball|2 years ago

Does the future of knowledge management involve using lots of AI to organize pieces of knowledge?

I think "here be dragons", and that over-relying on AI to do all your organization for you will very possibly (probably?) cause you to become worse at thinking.

No data to back this up because it is still early days in the proliferation of such tools, but historically making learning and thinking and "knowledge management" more passive does not improve outcomes.

bhpm|2 years ago

> I think "here be dragons", and that over-relying on AI to do all your organization for you will very possibly (probably?) cause you to become worse at thinking.

Socrates said exactly this.

But when they came to writing, Theuth said: “O King, here is something that, once learned, will make the Egyptians wiser and will improve their memory; I have discovered a potion for memory and for wisdom.” Thamus, however, replied: “O most expert Theuth, one man can give birth to the elements of an art, but only another can judge how they can benefit or harm those who will use them. And now, since you are the father of writing, your affection for it has made you describe its effects as the opposite of what they really are. In fact, it will introduce forgetfulness into the soul of those who learn it: they will not practice using their memory because they will put their trust in writing, which is external and depends on signs that belong to others, instead of trying to remember from the inside, completely on their own. You have not discovered a potion for remembering, but for reminding; you provide your students with the appearance of wisdom, not with its reality. Your invention will enable them to hear many things without being properly taught, and they will imagine that they have come to know much while for the most part they will know nothing. And they will be difficult to get along with, since they will merely appear to be wise instead of really being so.”

GTP|2 years ago

I don't think that the problem would be becoming worse at thinking, but I see a possible related problem. Each one of us has its own way of organizing things, that looks logical to us but not necessarily to others: think about how you organize things inside your home vs where other people put their stuff. A similar issue could arise with AI tools, that will classify and organize documents based on their logic, which doesn't necessarily align with ours.

samlhuillier|2 years ago

I agree with this.

In some cases, hard thinking and searching for things manually can really enhance understanding and build your knowledge.

In other cases, particularly when ideating for example, you want to be given "inspiration" from other related ideas to build upon other ideas you've had previously.

I think it's a mix of both - reaching for AI as and when when you need it - but avoiding it intentionally at times as well.

hgomersall|2 years ago

Honest discussion point: do you think organisational stuff is important thinking? IME it's precisely this sort of stuff that distracts me from thinking about hard stuff - the urgent displacing the important.

ParetoOptimal|2 years ago

I think you want to organize your own knowledge graph and then use the LLM to find novel connections or answer questions based upon it.

CrypticShift|2 years ago

Some suggestions :

- Create multiple independent "vaults" (like obsidian).

- Append links to related notes, so you can use (Obsidian's) graph view to map the AI connections.

- "Minimize" the UI to just the chat window.

- Read other formats (mainly pdfs).

- Integrate with browser history/bookmarks (maybe just a script to manually import them as markdown ?)

Thanks for Reor !

samlhuillier|2 years ago

Thanks for your feedback!

- Multiple vaults is in fact in a PR right now: https://github.com/reorproject/reor/pull/28

- Manual linking is coming.

- Minimizing the UI to chat is interesting. Right now I guess you can drag chat to cover anything - but yes perhaps a toggle between two modes could be interesting.

- Read other formats also something in the pipeline. Just need to sort out the editor itself to support something like this. Perhaps pdfs would just be embedded into the vector db but not accessible to the editor.

- Integrating with browser history and bookmarks is a big feature. Things like web clipping and bringing in context from different places are interesting...

zarathustreal|2 years ago

The problem with PDFs is that text isn’t necessarily text. Most RAG implementations that support them don’t do any sort of OCR or use local offline OCR implementations that have really low accuracy.

haswell|2 years ago

Literally yesterday I spun up a project with the intent to build something exactly like this for Obsidian.

Excited to see something already far more realized, and I’m looking forward to trying this out.

I’ve been working on a larger than small writing project using Obsidian, and my ultimate goal is to have conversations with the corpus of what I’ve written, and to use this to hone ideas and experiment with new ways of exploring the content.

Not sure if local LLMs are powerful enough yet to enable meaningful/reliable outcomes, but this is the kind of stuff that really excites me about the future of this tech.

sofixa|2 years ago

There are these plugins:

https://github.com/zatevakhin/obsidian-local-llm

https://github.com/hinterdupfinger/obsidian-ollama

Which already exist and if nothing else are decent starting points.

> Not sure if local LLMs are powerful enough yet to enable meaningful/reliable outcomes

I've dabbled, briefly, with Ollama running Mistral locally on an M1 MacBook Pro with 32GB of unified memory, and throwing a couple of hundred markdown documents at it via a RAG resulted in quite decent output to prompts asking questions about abstract contents/summariesbbased on those docs.

So I'd say we're already at a point where you can have meaningful outcomes; reliability is a whole other issue though.

wbogusz|2 years ago

Great to see something like this actualized. I’m a huge fan of Obsidian and its graph based connections for note taking.

Always see parallels drawn between Obsidian note structures and whole “2nd brain” idea for personal knowledge management, had seemed like a natural next step would be to implement note retrieval for intelligent references. Will have to check this out

davidy123|2 years ago

Super interesting project. I like its focus. Wondering if the author looked into Cozodb, or other databases that combine vector + graph/triples. Since probably neuro-symbolic is the best path. https://docs.cozodb.org/en/latest/releases/v0.6.html talks about this idea.

samlhuillier|2 years ago

Interesting. Thanks for sharing will take a look!

jddj|2 years ago

Extremely interesting read, thanks for sharing.

nerdjon|2 years ago

I have been looking for a while for a better way to take notes, what I was using worked fine but it did tend to end up being a blackhole.

I just downloaded this, I realize that it is still a new tool. But I think a critical feature needs to be context. The ability to have completely separate contexts of notes, maybe even completely different databases.

That way similar sounding to an LLM but contextually different don't get brought up. I figured that is what "new directory" did but it does not appear that way.

So is there any plans to implement a switcher for database? I can't find a way to change where it is right now.

But doing some quick tests importing some notes in it does seem very promising and I really like where you are taking it. It is just confusing notes that should be in distinct contexts.

Edit: I see this is already in PR! Awesome.

outside1234|2 years ago

Which local model works best for folks? Sort of intimidated by the large number of models on Hugging Face and it is hard to conceptualize which of the variants work the best.

I downloaded:

mistral-7b-v0.1.Q4_K_M.gguf Q4_K_M 4bits 4.37 GB 6.87 GB medium, balanced quality - recommended

Was that a good choice?

stuartriffle|2 years ago

Yes (imho) just be sure to get the instruct or chat model of any LLM you try. There is an awesome snapshot of what models people are using here:

https://openrouter.ai/rankings

elcombato|2 years ago

I really like this idea and the app, but beware when using your existing logseq folder it will mess up the structure/indentation/bullet-points of the notes.

syncbehind|2 years ago

Really wish I read this before I gave this a go

nerdyadventurer|2 years ago

Do not you use git with LogSeq?

bemusedthrow75|2 years ago

I think I struggle to see any application of LLMs for my notes that wouldn't, in practice, be just as easily implemented as a search facility.

My main challenge with my notes (that I've been collecting for about 15 years) is remembering to consult them before I google.

I suppose a unified interface to both my notes via LLM and internet search would help, but then I get that with my Apple Notes and the Mac's systemwide search, if I remember to use it.

traverseda|2 years ago

It's not the application of LLMs for your notes, it's the application of your notes for an LLM. Like if you're running a custom code-generation LLM, it could refer back to parts of your notes using retrieval aided generation to get some more context on the work you're having it do.

But yes, a good application is probably a ways away. Still, LLM vector embedding make a good search engine pretty easy to implement, especially if you're working with small sets of well curated data where exact keyword matching might not work great.

Like if you search for "happy" you could get your happiest journal entries, even if none of them explicitly mention the word happy.

dev_tty01|2 years ago

Can I still just run grep on my notes? Not trying to be snide, just wondering if the raw text remains available for simple text operations.

rcarmo|2 years ago

I did my usual test for these things - I tossed in the Markdown source for my site, which has 20 years of notes (https://taoofmac.com/static/graph).

Surprisingly, indexing sort of worked. But since I have an index.md per folder (so that media is grouped with text for every note) the editor is confused, and clicking on links always took me to a blank screen.

Also, pretty much every question gives an error message that says "Error: The default context shift strategy did not return a history that fits the context size", likely because there is too much context...

Edit: Fixed most of it by using a mistral instruct model. But the editor does not know what front matter is (neither in editing nor in previews, where front matter looks like huge heading blocks)

rcarmo|2 years ago

Also, it destroyed front matter on a few files I clicked on, which is a big no-no. Filed that as an issue.

dvorka|2 years ago

Rear is a really interesting project with admirable goals. I believe this is just the beginning, but you have already done a great job!

I have been working on my note-taking application (https://github.com/dvorka/mindforger) for some time and wanted to go in the same direction. However, I gave up (for now). I used ggerganov/llama.cpp to host LLM models locally on a CPU-only machine with 32GB RAM, and used them for both RAG and note-taking use cases (like https://www.mindforger.com/index-200.html#llm). However, it did not work well for me - the performance was poor (high hardware utilization, long response times, failures, and crashes) and the actual responses were rarely useful (off-topic and impractical responses, hallucinations). I tried llama-2 7B with 4b quantization and a couple of similar models. Although I'm not happy about it, I switched to an online commercial LLM because it performs really well in terms of response quality, speed, and affordability. I frequently use the integrated LLM in my note-taking app as it can be used for many things.

Anyway, Reor "only" uses the locally hosted LLM in the generation phase of the RAG, which is a nicely constraint use case. I believe that a really lightweight LLM - I'm thinking about a tiny base model fine-tuned for summarization - could be the way to go (fast, non-hallucinating). I'm really curious to know if you have any suggestions or if you will have any in the future!

As for the vector DB, considering the resource-related problems I mentioned earlier, I was thinking about something similar to facebookresearch/faiss, which, unlike LanceDB, is not a fully-fledged vector DB. Have you made any experiments with similarity search projects or vector DBs? I would be interested in the trade-offs similar to small/large/hosted LLMs.

Overall, I think that both RAG with my personal notes as a corpus and a locally hosted generic purpose LLM for the use cases I mentioned above can take personal note-taking apps to a new level. This is the way! ;)

Good luck with your project!

SamBam|2 years ago

So if I point this at my existing Obsidian library, what happens? Does this add to existing files, or add new files, to store the output of things generated by the AI? Doe the chunking of the files only happen within the vector database? What if I later edit my files in Obsidian and only open up Reor after -- does the full chucking happen every time, or can it notice that only a few new files exist?

Just wondering what the interaction might be for someone who uses Obsidian but might turn to this occasionally.

samlhuillier|2 years ago

It's filesystem mapping 1:1. Basically the same thing Obsidian does when you open a vault. You can create new files with Reor, create directories and edit existing files. Chunking happens only in vector DB and everything is synced automatically so you shouldn't notice anything if you reopen Reor after using Obsidian.

In short, yes it'd work seamlessly if you wanted to use it occasionally.

kingkongjaffa|2 years ago

I tried it and it doesn't really work, the models have no knowledge of my notes.

I tried llama-2-7b-chat.Q4_K_M.gguf and phi-2.Q4_K_M.gguf and neither showed any knowledge of the notes I added to the folder.

Does anyone know of a good way to test if it's working (a prompt?) and does anyone know of other projects like this?

dkarras|2 years ago

I had been researching stuff related to this for some time. Interesting project! Why not an obsidian plugin to tap into the ecosystem?

samlhuillier|2 years ago

Two reasons:

1. The libraries I used to run models locally didn't work inside a plugin.

2. I believe AI is a fairly big paradigm shift that requires new software.

cfcfcf|2 years ago

Seconded. I like this idea but wouldn't want to trade the Obsidian UI. Would love to see something like this as a plugin.

yosito|2 years ago

Seems promising, but I couldn't get it to work at all. Seems like I'm not the only one having issues: https://github.com/reorproject/reor/issues

unknown|2 years ago

[deleted]

2024throwaway|2 years ago

Running with a Local LLM on a Mac M1, this completely locked up my system for minutes. I tried to let it run, because the progress bar was ticking every now and then, but after 10 minutes I gave up and killed it.

raxxorraxor|2 years ago

You are running the wrong model. Get a smaller one Macs are usually ok with their performance since the GPU has access to main memory. A decent Nvidia GPU with lots of memory is still king though.

I run codelama:7b on a MacBook air, and even auto complete is partially usable.

tarruda|2 years ago

I wouldn't recommend unless you got at least 16gb ram (though possibly more is needed depending on what model is used).

erickf1|2 years ago

I like the idea. Unfortunately, could not get it to work on Linux. Making a note caused a crash. Searching notes crashed. LLM chat would cause crash. Hope to see it work some time.

gavmor|2 years ago

Seems cool, but didn't utilize my GPU? At any rate, definitely a futuristic POC, and prototype for the way I see desktop software going in the next few years.

samlhuillier|2 years ago

Yes unfortunately not implemented yet. Will be coming soon though :)

calebdre|2 years ago

This is really cool! Something i've actually been thinking about for a while.

Would you mind a pull request that spruces up the design a bit?

samlhuillier|2 years ago

Absolutely! Would love your help.

mrtesthah|2 years ago

It doesn't seem to view my plain text notes. What file formats are currently supported, if plain text is not?

samlhuillier|2 years ago

Just markdown right now, plain text is coming

poulpy123|2 years ago

since LLM are computer intensive, you should add some hardware requirements in the README

nextaccountic|2 years ago

Is this really fully open source? What is the catch / what is the proprietary part?

samlhuillier|2 years ago

No catch pal

sigmonsays|2 years ago

serious question: when do you ever have your own notes and can't find the answer ?

i would call it bad note taking to not be able to recall an answer you put into your notes.

I like the idea of something like this but i've struggled to find a real use case.

PeterisP|2 years ago

Oh I definitely have cases where I'm quite sure that I have had something in my notes and it takes quite an effort to find them.

It is probably relevant to note that I have more than a decade of notes, and more than half of them are not written by me but are excerpts from various relevant resources, scientific papers, etc; so there are cases where, for example, I know that many years ago I had seen an interesting application of method X and should have in my notes, but I don't necessarily remember the exact keywords how it was discussed back then.

infectoid|2 years ago

You underestimate the number of bad notetakers.

I’m one of them.

This tool might actually make me take better notes.

Maybe.

donclark|2 years ago

Could I share an idea(note) with a friend? And we grow the idea together?

samlhuillier|2 years ago

Not yet!

lenerdenator|2 years ago

How would this run on, say, a M2 Pro MBP with 32GB RAM?

alsetmusic|2 years ago

That should be more than enough. I've been running Ollama on an M1 Max with 64GB of RAM without issue.

frankcort|2 years ago

Wow cool, can I import my One Note notebooks?!!??

clscott|2 years ago

You can use Obsidian to create markdown from One Note.

https://help.obsidian.md/import/onenote

samlhuillier|2 years ago

If you can convert your One Note notes to markdown files then yes. On startup, you'll be asked to choose your vault directory - which needs to be a directory full of markdown files.

donclark|2 years ago

How would I use this as a mobile user?

PeterisP|2 years ago

Wait some years until mobile hardware is sufficiently powerful to run similar models locally.

Or don't do this at all, and rely on cloud models like most other solutions - running things locally has some benefits for privacy, control and perhaps cost, but you can get all of the same functionality without a local model.

dkwsupermanga|2 years ago

[deleted]

Donaldzibe|2 years ago

[deleted]

102 comments