wingman-jr | 1 month ago | on: The €10 Mirror: Why Enterprise Security Looks Like a Kid's Toy
wingman-jr's comments
wingman-jr | 1 month ago | on: If you put Apple icons in reverse it looks like someone getting good at design
wingman-jr | 4 months ago | on: HTTPS by default
wingman-jr | 5 months ago | on: T-Mobile Will Soon Begin LTE Phase-Out
wingman-jr | 1 year ago | on: Ask HN: How can I learn about manufacturing?
wingman-jr | 1 year ago | on: How to trade software for small money?
wingman-jr | 2 years ago | on: .NET Smart Components
wingman-jr | 2 years ago | on: Ask HN: SWEs who switched to AI/ML, how is it going?
wingman-jr | 2 years ago | on: 86-DOS Version 0.11 Found
wingman-jr | 2 years ago | on: Why Is the Front End Stack So Complicated?
wingman-jr | 2 years ago | on: Why Is the Front End Stack So Complicated?
wingman-jr | 2 years ago | on: Ask HN: How do your ML teams version datasets and models?
wingman-jr | 2 years ago | on: I want to talk about WebGPU
wingman-jr | 4 years ago | on: Show HN: An AI program to check videos for NSFW content
Regarding the current state of tech: I agree the tech still has quite a ways to go. I think one of the most interesting aspects here is how e.g. NSFW.js can get extremely high accuracy - but not necessarily perform better in the real world. I think it speaks in part to the nature of how CNN's work, the nature of the data, and the difficulty of the problem. Still, having seen how incredibly good "AI" has gotten in the last decade, I have quite a bit of hope here.
Regarding putting it on a server: that is indeed a fair question, but my desire is to keep the scanning on the client side for the user. In fact, it was actually the confluence of Firefox's webRequest response filtering (which is why I didn't make a Chrome version) and Tensorflow.js that allowed me to move from dream to reality as I had been waiting prior to that time. I can't afford server infrastructure if the user base grows, and people don't want to route all their pictures to me. So I guess I see the current way it works as a bonus, not a flaw - but it DOES impact performance, certainly.
Regarding data collection with respect to server - yes, this is something I've contemplated (there's a GitHub issue if you're curious). There are, however, two things that I've long mulled over: privacy and dark psychological patterns. Let me explain a bit. On the privacy front - it is not likely legal for a user to share the image data directly due to copyright, so they need to share by URL. This can have many issues when considering e.g. authenticated services, but one big one also is that the URL may have relatively sensitive user-identifying information buried in its path. I can try to be careful here but this absolutely precludes sharing this type of URL data as an open dataset. On the psychological dark patterns front - while I'm fine with folks wanting to submit false positives, I think there's a very real chance some will want to go flag all the images they can find that are false negatives (e.g. porn). I don't think that type of submission is particularly good for their mental health or mine. So, in general, I think user image feedback is something that would be quite powerful but needs a lot of care in how it would be approached.
Regarding the UX - thanks! And you're welcome to try the model as well - I've tried to include enough detail and data to allow others to integrate as they wish: https://github.com/wingman-jr-addon/model/tree/master/sqrxr_... Also, let us know how things go if you try out Darknet.
Good luck!
wingman-jr | 4 years ago | on: Introduction to the A* Algorithm (2014)
wingman-jr | 4 years ago | on: Show HN: An AI program to check videos for NSFW content
wingman-jr | 4 years ago | on: Show HN: An AI program to check videos for NSFW content
- Babies (https://github.com/wingman-jr-addon/wingman_jr/issues/22)
- Beach volleyball (but this definitely has SFW and NSFW variants, based on a somewhat subjective line)
- Athletes in general. The model particularly thought some American football players were NSFW for a long time.
- Swimming
- Yoga - again, most SFW and some NSFW here but it still struggles
- Wrestling was a tough one for sure
- Pokemon
While indeed tough, I've seen definite progress. So it's not just a matter of tech, but also of considering the human element - the state of the art may not be up to the challenge of perfection, but it is definitely up to a point of true utility for some use cases. I'm happy about that.As a note, it uses an EfficientNet Lite L0 backbone - I'm a bit limited in what type of scanning I can perform in a sufficiently speedy manner.
I also agree on the context for sure - one reason I haven't tried switching to an object detection method (and that I don't rely heavily on truly random crops) is that the focus of the image is highly important for the NSFW-ness in some cases. True, two images may contain the same content ... but one is far worse than the other. The nature of CNN's still has some of this location-invariance baked in, but I don't want to exacerbate it.
One challenge I think the OP may run into here that may also not be immediately obvious is that accuracy on image stills does not translate that well to video. I have basic video support in my addon, and while I knew there would be some differences, I was surprised at how many discrepancies there really are. As two examples:
- Images in video are often blurrier. In true still images, there is a somewhat higher prior involved with amateur NSFW content and blurriness. This can be a source of false positives.
- The opposite of the note above about focus. Taking stills of moving images will have many transitory frames that seem inappropriate on their own because it seems as if they are focusing on something when in reality the camera is just panning - obvious to the human, less so to the model trained on stills.
At any rate, given how well your list of edge cases coincided with failures I've grappled with, I'd be interested to see how well you think my addon stacks up for still images when set to stay in "normal" mode. I'd love to hear any feedback you have via GitHub so I can make it better.wingman-jr | 4 years ago
wingman-jr | 4 years ago | on: Ask HN: I love coding, but I hate web development. How can I “jump ship”?
wingman-jr | 4 years ago | on: WD My Book users wake up to find their data deleted
However, that doesn't fix the actual vulnerability itself. Anybody else affected have a suggestion for how to e.g. upgrade the relevant packages to make this secure?