sammyd56 | 5 months ago | on: Andrej Karpathy – It will take a decade to work through the issues with agents
sammyd56's comments
sammyd56 | 5 months ago | on: NanoChat – The best ChatGPT that $100 can buy
sammyd56 | 5 months ago | on: NanoChat – The best ChatGPT that $100 can buy
sammyd56 | 5 months ago | on: NanoChat – The best ChatGPT that $100 can buy
I didn't get as good results as Karpathy (unlucky seed?)
It's fun to play with though...
User: How many legs does a dog have? Assistant: That's a great question that has been debated by dog enthusiasts for centuries. There's no one "right" answer (...)
sammyd56 | 5 months ago | on: NanoChat – The best ChatGPT that $100 can buy
sammyd56 | 5 months ago | on: NanoChat – The best ChatGPT that $100 can buy
Will share the resulting model once ready (4 hours from now) for anyone to test inference.
sammyd56 | 3 years ago | on: GPT3 Get answers to technical questions from your documentation site
sammyd56 | 3 years ago | on: Poetry meets journalism, with LLMs and diffusion models
This one is mine. It's a light-hearted digital newspaper of sorts, covering news from local British communities through the medium of verse (generated by LLMs).
Until now I've been using ChatGPT for the generation, with a fairly generic prompt that asks for a poem about the article that follows. ChatGPT's ability to summarise is incredible. It's really not great, though, at rhyme and meter. That means a decent amount of curation and heavy editing is needed for the best to get something passable. Prompt engineering has not seemed to have a meaningful impact. I'm looking to fine-tune a davinci model, which I think will deliver higher quality with less effort.
Some example from the current process:
Poem: https://rhymingreporter.art/farewell-little-red/ | Original article: https://www.cornwalllive.com/whats-on/food-drink/little-red-...
Poem: https://rhymingreporter.art/flowing-frocks-icy-blue/ | Original article: https://www.cornwalllive.com/whats-on/whats-on-news/gallery/...
The quality can mostly be blamed on me, rather than GPT-3. I haven't written a poem since school :)
The accompanying illustrations are created with Stable Diffusion using DiffusionBee. Images take around 30s to generate on my Macbook Air M1. I'm looking to switch to MochiDiffusion to cut generation time a bit.
The blog is running Ghost on a small DigitalOcean VPS, with emails delivered by Mailgun.
The process right now is somewhat labour-intensive: between researching news stories, iterating on the content, and publishing, it takes a decent amount of time for each piece of content. I'm confident in being able to automate a large part of it, in time.
One fun fact I learned when planning the virtual road-trip for this project: in average traffic conditions, it's possible to visit every city in England in less than 48 hours. The near-optimum solution to this formulation of the Traveling Salesmen Problem (starting in the South West), a route taking 47:00:10, was calculated in less than 5 seconds with a Guided Local Search algorithm. [1]
Technology means that I can virtually, learn about, write creatively, and publish regularly, all whilst having a family and a full-time job. What a time to be alive!
Very open to your thoughts, and indeed to feedback on the concept or the execution.
sammyd56 | 3 years ago | on: Ask HN: Upskilling as a Data Engineer
For short-term career growth, $YOUR_COMPANY's current preferred ETL tool will have the biggest ROI. Focus on design patterns: while APIs will come and go, the concepts, as you rightly say, are transferrable.
If you're looking to land a new role: the market says dbt, databricks and snowflake are pretty strong bets.
If it's personal interest, or a high-risk, high-reward long term play, take your pick from any of the new hotness!
sammyd56 | 5 years ago | on: Launch HN: Kanda (YC W21) – Let tradespeople offer finance to their customers
sammyd56 | 5 years ago | on: Launch HN: Synth (YC S20) – Realistic, synthetic test data for your app
* Creating models from a file borks on anything non-UTF8 (i.e. most legacy system outputs)
* `synth model inspect` output does not match the docs - how do I see the JSON?
sammyd56 | 6 years ago | on: 380k Guesses Dataset – Higher or Lower?
sammyd56 | 6 years ago | on: 380k Guesses Dataset – Higher or Lower?
sammyd56 | 6 years ago | on: 380k Guesses
edit: Updated licence to CC-BY-SA (i.e. do what you want as long as you credit)
edit2: Don't seem to be able to re-post :(
sammyd56 | 6 years ago | on: Ask HN: Who wants to be hired? (November 2019)
Location: London, UK
Remote: Yes
Willing to relocate: No
Technologies: Python, SQL, Javascript, AWS, Linux
Résumé/CV: on request
Email: sjd followed by three threes (gmail)
sammyd56 | 8 years ago | on: Show HN: A free, lightweight static page to get stock quotes using the IEX API
https://samdobson.github.io/stocks/?Banks=GS,MS,JPM&Tech=AAP...
sammyd56 | 11 years ago | on: Share: The Icon No One Agrees On
sammyd56 | 11 years ago | on: Why Game Developers Keep Getting Laid Off
sammyd56 | 13 years ago | on: Adria Richards, PyCon, and How We All Lost
http://butyoureagirl.com/13871/success-against-the-odds-fill...
(site is currently down, may need to view cache)
I'm surprised nobody has picked up on this yet.
sammyd56 | 13 years ago | on: Why to Make Your App Free for Education
As a teacher and member of that generation, thankyou.
Are there any other awesome product like lucidchart available free of charge to K-12 schools?