(no title)
liuru | 4 years ago
I hope you all enjoy playing with the new and improved generator! We've been hard at work improving the model quality since the last time the site was posted[1]
As both a professional fantasy illustrator & software engineer, I find the concept of AI creativity so fascinating. On one hand, I know that mathematically AI only can hallucinate images that fit within the distribution of things that it's seen. But from the artist perspective, the model's ability to blend two existing styles into something so distinctly new is so incredible (and not to mention also commercially useful!)
Anyways, happy to answer any question, thoughts, or concerns!
---
wodenokoto|4 years ago
Can you talk a little about team size, work process, funding and revenue stream? I think the effort required for such an undertaking is vastly underestimated by readers.
Cixelyn|4 years ago
> I think the effort required for such an undertaking is vastly underestimated by readers.
Haha for sure. Hosting a real-time ML model for people to do sub 1-second inferences at HN-load scale is definitely nontrivial.
[1] https://arrowmancer.com
hansel_der|4 years ago
same here. what's naive about it?
not to badmouth the undertaking, but wtf is this doing on HN?
2bitencryption|4 years ago
My question is, how do you figure out how to parameterize "Same character, different pose" / "Same character, different eyes" / "Same character, different gender" / etc?
My (super limited) understanding of GANs is that they slowly discover these features over time simply from observation in the data set, and not from any labels.
So how could you make e.x. a slider for head position, style, pose, etc? How do you look at the resulting model and figure out "these are the inputs we have to fiddle with to make it use a certain pose"?
You mention it a bit in this section, but I didn't fully understand: "By isolating the vectors that control certain features, we can create results like different pose, same character"
And I assume the same step needs to be done every time the model is retrained or fine-tuned, because possibly the vectors have shifted within the model since they are not fixed by design?
liuru|4 years ago
You can think of it like coordinates on a many-dimensional vector grid.
We craft the functions the functions that will illuminate sets of those points based on a combination of observation, what we know about our model architecture, and how our data is arranged.
And yes, when the model is retrained, we have to discover them again!
flor1s|4 years ago
thyrox|4 years ago
ridaj|4 years ago
A couple questions:
1) I didn't really understand how you went about identifying what vectors of the latent space stand for various things, like pose or color. Did you train one of the AIs to that effect, or did you manually inspect a bunch of vectors, twiddling through them one by one, did to the outcome?
2) If one were to train an AI to the same level using commodity cloud services, what's the order of magnitude cost that you would pay for the training? More like $100, $1,000, $10,000 or $100,000?
liuru|4 years ago
2) Depends on the quality you are seeking. If you only want one run of a similar, off-the-shelf model, around the 1000s is enough. But at the number of iterations you have to run to build your own and improve results, you probably need about 100k.
To tackle this problem, we built our own supercomputer off of parts we bought off of ebay, though I can't say I recommend that route, because it now lives in our living room.
dimgl|4 years ago
Does this mean two weeks of development, or two weeks to generate the images we're seeing? Or maybe did you train the model for two weeks? That point just wasn't exactly clear for me.
liuru|4 years ago
Development took on-and-off roughly 2 years to achieve the quality you see today.
kouteiheika|4 years ago
JetAlone|4 years ago
kregasaurusrex|4 years ago
liuru|4 years ago
We're currently working on the data migration from V1! As long as you are using the same email as you did in 2019, you'll be able to see the image again!
As for a V2 generation, sorry, because the models are different, you'll have to discover a similar image again, if you want a V2 version!
rackjack|4 years ago
liuru|4 years ago
There was such popular demand for these "horror" images that we made them part of the generation in V2! If you refresh enough on the webpage, you can find some horrors!
Cthulhu_|4 years ago
I've seen a number of mobile games that just get flooded with characters; this tool looks like it could be used to automate that process. It could be combined with AI-generated character profiles as well, creating an 'infinite' character roster in video games.
Terry_Roll|4 years ago
In humans, things like the pupil can be the give away.
https://www.newscientist.com/article/2289815-ai-can-detect-a...
yccs27|4 years ago
oneoff786|4 years ago
hypertele-Xii|4 years ago
Gigachad|4 years ago
liuru|4 years ago
Like this one by fast.ai!
https://docs.fast.ai/vision.gan.html
Afforess|4 years ago
zozbot234|4 years ago
simonebrunozzi|4 years ago
Is there an email to reach out to you or someone in the team? ($HNusername @ gmail)
Cixelyn|4 years ago
GoblinSlayer|4 years ago
searchableguy|4 years ago
I think I could use this for a project.
liuru|4 years ago
YeGoblynQueenne|4 years ago
>> It is interesting to note that from this process, the AI is not merely learning to copy the works it has seen, but forming high-level (shapes) and low-level (texture) features for constructing original pictures in its own mental representation.
Can you explain what you mean by "mental" representation? Does your system have a mind?
Also, why are you calling it "an AI"? Is it because you think it is an artificial intelligence, say like the robots in science fiction movies? Is it capable of anything else than generating images?
xg15|4 years ago
On each step, high-level parameters are combined with predefined weights to produce a more low-level output.
Seems, a similar transformation is going on here, except that the weights and the structure are somehow learned on its own.
jacoblambda|4 years ago
tedmcory77|4 years ago
darkengine|4 years ago
liuru|4 years ago
https://www.thiswaifudoesnotexist.net/
unobatbayar|4 years ago
liuru|4 years ago
https://www.gwern.net/Danbooru2020
Though now we have made our own :)
lynzrand|4 years ago
https://www.gwern.net/Danbooru2020
Bombthecat|4 years ago