Not bad, but I think it would be more useful if I could submit an image and have the engine give me all the facts it could dig up about it, based on its context in other pages, geo tags and camera type (if available), etc.
I think we're going to see some very interesting developments along these lines very soon. Scary stuff too. Imagine submitting a picture of yourself and finding out what the internet knows about you based on your physical appearance. Better keep those Facebook profiles private, folks! More than that, you'll have to convince your friends to keep their profiles private if they have pics of you as well!
<tangent> This is what is rather frightening about the next web; even if you want to remain anonymous, you're going to have to do battle with all the other folks who are more than happy to post and tag pictures of you for the world to see (with good -natured intentions, I might add). Remember that embarrassing moment at that party where you had a little too much to drink? Oh, you were too drunk to recall? Well, it's on somebody's public Facebook profile now. With your name on it. And if I am your employer, what's to stop me from taking your badge photo and plugging it into a service to pull down other pictures of you from the cloud? :O </tangent>
Anyway, back to the matter at hand! I do see your service as being particularly valuable to IP holders who want to know who is displaying their copyrighted images or logos without authorization. If your site were comprehensive enough, you could probably go freemium and become a paid tattle-tale. Take that a step further and "For a nominal fee, you can click here to have our partners at LegalZoom.com send a takedown notice."
You can use http://imageheader.com/alpha , which reads image file and displays all information about the image including geolocation tags. This is alpha version
Their crawler could use some work. (I uploaded my PR screenshot of my software. They flagged several copies which are domains 301 redirected to my homepage.)
The algorithm, however, is beyond awesome. They found half a dozen instances of my screenshot on the Internet, including my site, some download sites, and a Chinese pirate or two who had gone to the trouble of watermarking my image.
TinEye was created by Idée Inc. Idée develops advanced
image identification and visual search software for photo
wire agencies, stock photography firms, entertainment media
companies and some of the world's leading imaging firms
including Adobe Systems Inc.
In other words, yes they intend for it to be used by content owners to find unauthorized use of their IP on the web. On the other hand, they claim to respect robots.txt and give their crawler name on the same page.
I have an idea for how to do a better reverse image search engine. The problem with this one is that it only matches exact images. But what if it was possible to match against any feature within the source image, rather than the exact, entire source image you search for?
For each image in the index, break it up into 4x4 tiles, then store a hash code for each tile. Then repeat the process, but offset the boundary of each tile by 1px along the X axis. Repeat 2 more times. Then, for each offset along the X axis, offset down along the Y axis. So you store 16 hashes per 4x4 pixel area.
Now, when someone searches for an image, repeat that hashing algorithm for the source image. The results page then returns any image that contains a 4x4 tile that is also contained in the source image, ranked by the number of tiles within the image that is common between the source and result image.
The end result is that you can see how the features within an image are used in other images -- so if someone takes the red stapler from Office Space ( http://www.yunasville.com/img/102005/milton.jpg ) and puts it into a different image, and you search for that red stapler, the results page will still return the photoshopped image, because it'll match the 4x4 tiles on the stapler in both images.
I've explained this in a convoluted way, but hopefully I've communicated the essence of the idea.
On one hand, there will be more results to filter through, and it's more computationally expensive. But that's fine, the image results are still ranked effectively. On the other hand, it's more computationally expensive.
From the site after a 0 match result: "TinEye looks for the specific image you uploaded, not the content of the image. TinEye cannot identify people or objects in an image."
So it's not what some of us might have been afraid of.
Nor me when I found it a couple of months ago and I needed such a service. I wanted to source the true manufacturer of a product and all I was finding at the time were a lot of traders who were using the same official product photos. I think it will improve though and the web needs it.
I remember thinking about a service like this but for audio - for example you post a link to youtube video or some similar service and it would match audio in video with a song name. That would be pretty cool and useful.
As a photographer I must say that istockphoto and the non licensed (the proper term escapes me) images on flickr give bloggers and web designers no excuses.
Content owners pay a lot of money to protect their content. They pay money to lawyers, lobbyists, and companies like Media Sentry to snoop in on P2P traffic.
It's not a stretch to say they will pay to find stolen images. I'd venture they'd pay pretty well too.
[+] [-] ruby_roo|17 years ago|reply
I think we're going to see some very interesting developments along these lines very soon. Scary stuff too. Imagine submitting a picture of yourself and finding out what the internet knows about you based on your physical appearance. Better keep those Facebook profiles private, folks! More than that, you'll have to convince your friends to keep their profiles private if they have pics of you as well!
<tangent> This is what is rather frightening about the next web; even if you want to remain anonymous, you're going to have to do battle with all the other folks who are more than happy to post and tag pictures of you for the world to see (with good -natured intentions, I might add). Remember that embarrassing moment at that party where you had a little too much to drink? Oh, you were too drunk to recall? Well, it's on somebody's public Facebook profile now. With your name on it. And if I am your employer, what's to stop me from taking your badge photo and plugging it into a service to pull down other pictures of you from the cloud? :O </tangent>
Anyway, back to the matter at hand! I do see your service as being particularly valuable to IP holders who want to know who is displaying their copyrighted images or logos without authorization. If your site were comprehensive enough, you could probably go freemium and become a paid tattle-tale. Take that a step further and "For a nominal fee, you can click here to have our partners at LegalZoom.com send a takedown notice."
:)
[+] [-] wave|17 years ago|reply
Example:
http://imageheader.com/alpha/index.php?url=http://farm4.stat...
[+] [-] FlorinAndrei|17 years ago|reply
[+] [-] froo|17 years ago|reply
Actually I think this sort of technology would be just as useful for the dating sites to help weed out potential fake profiles.
[+] [-] blasdel|17 years ago|reply
[+] [-] cousin_it|17 years ago|reply
[+] [-] palish|17 years ago|reply
[+] [-] cdr|17 years ago|reply
[+] [-] patio11|17 years ago|reply
The algorithm, however, is beyond awesome. They found half a dozen instances of my screenshot on the Internet, including my site, some download sites, and a Chinese pirate or two who had gone to the trouble of watermarking my image.
I think Getty just had kittens.
[+] [-] shader|17 years ago|reply
For some cool examples, check out http://tineye.com/cool_searches
Also, I recommend that you click the "compare images link" under each result image after you perform a search, to see which part it matched.
I've used tineye several times, and it's found the sources of heavily photoshopped images before. They've done a great job.
[+] [-] aminuit|17 years ago|reply
[+] [-] ambition|17 years ago|reply
[+] [-] entelarust|17 years ago|reply
if you browse the results, it even finds the image's use in formatted/manipulated graphics
[+] [-] anamax|17 years ago|reply
[+] [-] pixcavator|17 years ago|reply
[+] [-] andrewl-hn|17 years ago|reply
[+] [-] palish|17 years ago|reply
For each image in the index, break it up into 4x4 tiles, then store a hash code for each tile. Then repeat the process, but offset the boundary of each tile by 1px along the X axis. Repeat 2 more times. Then, for each offset along the X axis, offset down along the Y axis. So you store 16 hashes per 4x4 pixel area.
Now, when someone searches for an image, repeat that hashing algorithm for the source image. The results page then returns any image that contains a 4x4 tile that is also contained in the source image, ranked by the number of tiles within the image that is common between the source and result image.
The end result is that you can see how the features within an image are used in other images -- so if someone takes the red stapler from Office Space ( http://www.yunasville.com/img/102005/milton.jpg ) and puts it into a different image, and you search for that red stapler, the results page will still return the photoshopped image, because it'll match the 4x4 tiles on the stapler in both images.
I've explained this in a convoluted way, but hopefully I've communicated the essence of the idea.
On one hand, there will be more results to filter through, and it's more computationally expensive. But that's fine, the image results are still ranked effectively. On the other hand, it's more computationally expensive.
[+] [-] habibur|17 years ago|reply
So it's not what some of us might have been afraid of.
[+] [-] staunch|17 years ago|reply
Very impressive work. I can't think of any real need for it myself, but it is cool.
[+] [-] varun|17 years ago|reply
[+] [-] motoko|17 years ago|reply
[+] [-] dc2k08|17 years ago|reply
[+] [-] coglethorpe|17 years ago|reply
[+] [-] Keyframe|17 years ago|reply
[+] [-] jpwagner|17 years ago|reply
[+] [-] quizbiz|17 years ago|reply
[+] [-] globalrev|17 years ago|reply
But will people pay for such a service?
There are just so many companies starting with seemingly no way to make money.
[+] [-] mustpax|17 years ago|reply
It's not a stretch to say they will pay to find stolen images. I'd venture they'd pay pretty well too.
[+] [-] ambition|17 years ago|reply