Show HN: Dayflow – A git log for your day
480 points| jerryliu12 | 5 months ago |github.com
Here's what it does:
- It creates a semantic timeline of your day;
- It does it by understanding the content on your screen (with local or cloud VLMs);
- This allows you to see exactly where your time went without any manual logging.
Traditional time trackers tell you "3 hours in Chrome" which is not very helpful. Dayflow actually understands if you're reading documentation, debugging code, or scrolling HN. Instead of "Chrome: 3 hours", you get "Reviewed PR comments: 45min", "Read HN thread about Rust: 20min", "Debugged auth flow: 1.5hr".
I was an early Rewind user but rarely used the retrieval feature. I built Dayflow because I saw other interesting uses for screen data. I find that it helps me stay on track while working - I check it every few hours and make sure I’m spending my time the way I intended - if I’m not, I try to course correct.
Here’s what you need to know about privacy:
- Run 100% locally using qwen2.5-vl-3b (~4GB model)
- No cloud uploads, no account
- Full source available under MIT license (https://github.com/JerryZLiu/Dayflow)
- Optional: BYO Gemini API key for better quality (stored in Keychain, with free-tier workaround to prevent training on your data)
The tech stack is pretty simple, SwiftUI with a local sqlite DB. Uses native macOS apis for efficient screen captures. Since most people who run LLMs locally already have their tool of choice (Ollama, LLMStudio, etc.), I decided to not embed an LLM into Dayflow.
By far the biggest challenge was adapting from SOTA vision models like Gemini 2.5 Pro to small, local models. My constraints were that it had to take up <4GB of ram and have vision capabilities. I had to do a lot of evals to figure out that Qwen2.5VL-3B was the best balance of size and quality, but there was still a sizable tradeoff in quality that I had to accept. I also got creative with sampling rates and prompt chunking to deal with the 100x smaller context window. Processing a 15 minute segment takes ~32 local LLM calls vs 2 Gemini calls!
Here’s what I’m working on next:
Distillation: Using Gemini's high-quality outputs as training data to teach a local model the patterns it needs, hopefully closing the quality gap.
Custom dashboards where you can track answers to any question like "How long did I spend on HN?" or "Hours until my first deep work session of the day
I'd love to hear your thoughts, especially if you've struggled with productivity tracking or have ideas for what you'd want from a tool like this.
andrewmutz|5 months ago
1zael|5 months ago
whalesalad|5 months ago
MollyRealized|5 months ago
She had been complaining the day before about having to reconstruct a huge bunch of little 0.1 entries involving e-mails to various individuals in cases. If it could be done automatically, through a local LLM? chef's kiss
Trust me, law is definitely where you want to land this thing.
In all honesty, I have absolutely no negotiating power or decision-making authority for my firm, but it's a big one -- if that's a direction you want to go, can't guaranty I can swing enough weight, but I probably could find you the right people to talk to, give you an introduction.
mellosouls|5 months ago
laurieg|5 months ago
Some things I would like to be able to do with software like this:
- Identify the 'spark' of a distraction. For example, opening my email inbox to read a specific email also shows me many unrelated emails. These can easily be the cause of a 5-15 minute distraction. This information is often actionable. I installed browser plugins to hide my youtube suggested videos and my distractions went down. I made sure to close all unused windows to avoid catching a glimpse of unrelated work.
- Identify repeated tasks, and the cadence of those tasks. Do I manually make an invoice once a week for a particular edge case? Is the process basically identical every time. Could this be automated?
- How was I feeling before, during and after a task. (This is a very broad and intentionally not well-defined question, but I think it has the most promise for improving procrastination and task initiation).
jerryliu12|5 months ago
rw2|5 months ago
astafrig|5 months ago
yewenjie|5 months ago
jerryliu12|5 months ago
I'm reading through papers that suggest it should be possible to get SOTA performance on local models via distillation, and that's what I'll experiment with next.
muzani|5 months ago
CIPHERSTONE|5 months ago
nemo1618|5 months ago
LocalPCGuy|5 months ago
Going a step further, "real time" (given processing delay) to help stay on task when the focus has shifted to something unrelated (maybe allow the individual to define this or say yes/no to train the prompts as it goes).
Anyways, it looks great. I also liked the _idea_ of Windows Recall, so to see something like this that can be privacy first is really nice.
olex|5 months ago
edit Nvm, it seems it always records the display that is currently in focus. That is probably the better way to handle it, since it automatically solves the "ignore what's shown but not interacted with on secondary displays" problem.
LocalPCGuy|5 months ago
jerryliu12|5 months ago
thalesac|5 months ago
jappwilson|5 months ago
github.com/mediar-ai/screenpipe
louis030195|5 months ago
kind of sad it's macos only, i'm mostly windows user now :)
r0bbie|5 months ago
jerryliu12|5 months ago
zeroq|5 months ago
This could help battle procrastination, organize your time in a long run, bill your clients more efficiently, etc. 20 years younger, hyper productive me would kill for such product.
But then I recall when I accidently suggested TimeRescue to my boss at one time, and suddenly he was skimming though everyones daily logs to see if they're spending 100% of their times in business facing apps.
When I first heard about "covid mouse mover devices" that faked activity for remote workers I thought it was a joke. Seriously.
But I'm afraid this is the dystopian future. Employers constantly looking at your screen and getting spreadsheets with your daily effort.
Overall, very disturbing product.
defgeneric|5 months ago
jerryliu12|5 months ago
tmychow|5 months ago
Feel like something of this shape should have existed for a while, but this is very well executed!
requilence|5 months ago
jerryliu12|5 months ago
ahoog42|5 months ago
hx8|5 months ago
tolerance|5 months ago
https://brettterpstra.com/projects/doing/
pastapliiats|5 months ago
7bit|5 months ago
boomlify|5 months ago
tiernano|5 months ago
jerryliu12|5 months ago
lucfranken|5 months ago
Compliments for the Wizard - that one works perfect at least with Gemini. One little detail: You have a Github Star button in it, that really was at a non-logical place and made me think.
jauntywundrkind|5 months ago
Replace PostgreSQL with Git for your next project for git data storing. https://news.ycombinator.com/item?id=4535144 https://devcenter.upsun.com/posts/why-you-should-replace-pos...
Consumer.today day-logging single user microsite. https://consumed.today/ https://news.ycombinator.com/item?id=45351446
Cute serendipity, rule of three. Neat project too; conceptually it sounds like an amazing ability to be able to better watch ourselves. Doing it via screenshots & AI feels like a fun sense-making adventure that actually makes a lot of sense, that can maybe try to pick through & discern what the screen is doing in a lot of different scenarios.
mrklol|5 months ago
If it’s recording 15 seconds, how often are you doing that? Once every 15m as the analysis interval is 15m?
pi-err|5 months ago
So I'm not sure I buy the lightweight/low-impact claim.
novoreorx|5 months ago
[1]: https://screenmemory.app/
ghm2199|5 months ago
1. "Create a reminder for reading this email at 5:00 pm" and this could infer what to do from the screen shot's description(plus a local MCP tool for calendar)
2. "Can you fetch that file form that project in that workspace and implement the pattern in the code on my vscode terminal?" It can lower cognitive fatigue of typing and clicking a bunch of place.
3. Take notes as I describe something on the screen. It could be for prompt composition e.g. get the link from my browser and the file on vscode and write code that does XYZ.
anyg|5 months ago
Another approach is to run OCR on 1FPS screenshots. Everything runs locally without draining the battery like an LLM would.
jerryliu12|5 months ago
rcarmo|5 months ago
Maybe patching https://github.com/JerryZLiu/Dayflow/blob/main/Dayflow/Dayfl... to say "Describe what you seen in this computer screen in the style of Werner Herzog" would do it...
lucfranken|5 months ago
As already seen in the comments there are lots of desires to add more data compared to just screen input.
Could be things like:
- Apple HealthKit / watch - custom apps - Phone logs
Also you stated, and true, that there is much focus needed on improving your core feature.
It might be interesting to allow some kind of API / plugin area. So that people can expand on your core feature and add the desired parts. Might in the future expand to some kind of AppStore like feature with plugins.
That would keep your work focused and allows others to make it complete in their vision, and for others.
p_zuckerman|5 months ago
ttoinou|5 months ago
philipallstar|5 months ago
Right?
mustaphah|5 months ago
Funny enough, I had a similar idea a few weeks back; I jotted it down in my idea sketchpad. It felt a bit ambitious for an open-source side project, and I wasn't sure if it could even work with a local LLM. I was genuinely excited about it, nonetheless.
Now that I know it's totally viable, I've got even more reasons to build a Linux version myself.
Klaster_1|5 months ago
christoph123|5 months ago
aiven|5 months ago
rokob|5 months ago
dpflan|5 months ago
sipjca|5 months ago
atoav|5 months ago
xp84|5 months ago
jerryliu12|5 months ago
graeme|5 months ago
jerryliu12|5 months ago
fsto|5 months ago
pgcosta|5 months ago
sawyna|5 months ago
mellosouls|5 months ago
Kudos particularly for the efforts you've gone to on explaining privacy implications.
jerryliu12|5 months ago
danielfalbo|5 months ago
netnameus|5 months ago
voidUpdate|5 months ago
blef|5 months ago
user3939382|5 months ago
dmd|5 months ago
danielfalbo|5 months ago
smcleod|5 months ago
jerryliu12|5 months ago
chewhongjun96|5 months ago
i.e. apple watch for sleep, running, activity levels? it could really give a 360 view of your life
jerryliu12|5 months ago
wayeq|5 months ago
rememberlenny|5 months ago
muggermuch|5 months ago
zeeqeen|5 months ago
ttoinou|5 months ago
akhilnchauhan|5 months ago
VadimPR|5 months ago
tonyhart7|5 months ago
ctrlp|5 months ago
rasulkireev|5 months ago
j1000|5 months ago
scuff3d|5 months ago
graeme|5 months ago
syngrog66|5 months ago
just wow
2025 is getting surreal online
matthewparal|5 months ago
[deleted]