Show HN: Generative Fill with AI and 3D
360 points| olokobayusuf | 2 years ago |github.com | reply
You've probably seen projects that add objects to an image from a style or text prompt, like InteriorAI (levelsio) and Adobe Firefly. The prevalent issue with these diffusion-based inpainting approaches is that they don't yet have great conditioning on lighting, perspective, and structure. You'll often get incorrect or generic shadows; warped-looking objects; and distorted backgrounds.
What is Fill 3D? Fill 3D is an exploration on doing generative fill in 3D to render ultra-realistic results that harmonize with the background image, using industry-standard path tracing, akin to compositing in Hollywood movies.
How does it work? 1. Deproject: First, deproject an image to a 3D shell using both geometric and photometric cues from the input image. 2. Place: Draw rectangles and describe what you want in them, akin to Photoshop's Generative Fill feature. 3. Render: Use good ol' path tracing to render ultra-realistic results.
Why Fill 3D? + The results are insanely realistic (see video in the github repo, or on the website). + Fast enough: Currently, generations take 40-80 seconds. Diffusion takes ~10seconds, so we're slower, but for the level of realism, it's pretty good. + Potential applications: I'm thinking of virtual staging in real estate media, what do you think?
Check it out at https://fill3d.ai + There's API access! :D + Right now, you need an image of an empty room. Will loosen this restriction over time.
Fill 3D is built on Function (https://fxn.ai). With Function, I can run the Python functions that do the steps above on powerful GPUs with only code (no Dockerfile, YAML, k8s, etc), and invoke them from just about anywhere. I'm the founder of fxn.
Tell me what you think!!
PS: This is my first Show HN, so please be nice :)
[+] [-] LeonM|2 years ago|reply
Where I live, it is already common practice for real estate 'agents' to photoshop the properties listed for sale to make them look fully renovated and furnished. When in reality the house is empty and in very bad shape.
This tech will make it even harder to judge a property without actually viewing it in real life.
I think we can no longer stop tech like this from being used in ads (because that's effectively what property listings are nowadays). The only solution I think is policies/laws that prevent real-estate marketplaces from showing fake pictures.
That all said, I think the author can make big money from realtors by selling this tech as a subscription model.
[+] [-] linsomniac|2 years ago|reply
I'm sure it varies area to area, but the biggest thing I see in our area is things like adding sunsets in the windows or behind the property photos, but we wouldn't necessarily know if a Realtor had photoshopped out mold or water damage or the like.
[+] [-] smrtinsert|2 years ago|reply
[+] [-] matsemann|2 years ago|reply
Also in that regards: I'm curious about what it can't handle. Any situations where it borks?
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] reichardt|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] mentos|2 years ago|reply
I’ve got a big empty studio with a bed and couch I’ve already purchased but trying to figure out what to fill in for all the other gaps. Coffee table, media console, tv or UST projector, bar or bookshelf or desk.
Would be nice if there was a way to populate it with items/products that can be purchased and aren’t purely conceptual.
[+] [-] olokobayusuf|2 years ago|reply
Exciting times ahead.
[+] [-] mft_|2 years ago|reply
(No affiliation!)
[+] [-] RockRobotRock|2 years ago|reply
[+] [-] sci_prog|2 years ago|reply
[+] [-] thih9|2 years ago|reply
[+] [-] qingcharles|2 years ago|reply
I have images of decks and porches that need staging for the construction company's web site.
[+] [-] billconan|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] bsenftner|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] linsomniac|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] idank|2 years ago|reply
[+] [-] aroopchandra|2 years ago|reply
[+] [-] llwj|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] pedalpete|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] jayd16|2 years ago|reply
[+] [-] sourabh03agr|2 years ago|reply
[+] [-] aantix|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] tinytera|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] kderbyma|2 years ago|reply
[+] [-] doix|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] init2null|2 years ago|reply
[+] [-] ralfhn|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] philipov|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] blovescoffee|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
So it works by trying to estimate a 3D 'room' that matches your image. Everything from the geometry, to the light fixtures, to the windows. It's heavily inspired by how humans (weird to contrast 'human' vs. AI work) do image/video compositing.
TL;DR: Image in, 3D scene out.
[+] [-] artursapek|2 years ago|reply
[+] [-] olokobayusuf|2 years ago|reply
[+] [-] moritonal|2 years ago|reply
https://investor.wayfair.com/news/news-details/2023/Wayfair-...
[+] [-] olokobayusuf|2 years ago|reply