Imagine being called John Graham-Cumming. Long, long ago Google didn't understand that "Cumming" was a name. Google myself, get served ads for adult web sites.
And Eudora's Mood Watch feature would flag every single email I sent as offensive.
There's an old joke about a girl named Megan E. Cummings, who successfully petitioned for a change to her university e-mail, which had been auto-generated according to the scheme `substr($LASTNAME, 6).$FIRSTINIT.$MIDINIT`.
Try keeping up with the journals as a chemist while on holiday behind an overeager webproxy. You are told that the subdiscipline of analytical chemistry is out of bounds.
But that's a feature. The voting public sees that you are trying hard and failing, that's somehow considered better than shaking your head at the intractable problem.
In my former life as a mathematician, I worked on analytic combinatorics. Mathematicians aren't quite as aggressive about abbreviating as chemists, so I never saw the abbreviation "anal. comb." in the wild, but I always expected to.
I've run into this problem myself when parsing recipes for food allergies . Doughnuts has the word nuts in it but doesn't always contain nuts as an ingredient .
This happened at Medium [0] because they hash paragraphs to a 4-digit hexadecimal string, and ad blockers would hide things like "#ad01", "#ad02", etc.
Same here for our Hackespace a year or so ago - our sponsor logos weren't showing; we had to change the CSS class to some arbitrary meaningless phrase.
This problem could be solved by defining a logical rule (most probably through a regular expression) that would only filter the bad word when present as a single word.
I'm amazed how rarely this simple system is used. Instead you end up with monstrosities such as the power stars chat that mangles most words into unreadable mess of .
When I worked for a company that made label printers we had a potential customer who wanted us to print labels with human readable and barcode fields with 4 random letters and 4 random digits but did not want the letters to spell any obscene words. We asked for a list of words to ban but they declined to provide such a list. We did not get the contract.
Note that the problem of words being misunderstood when lacking context is not limited to computers. My father - a chemistry professor - was at a conference a few years ago about Free Radicals when he was approached by a member of the public who wanted to know if he could participate...
Not quite as funny, but I used to work for a company that had the word microwave in its name - we made radar components and such. We once were approached by someone who wanted us to repair their microwave oven. Our building had no obvious sign and didn't look like an appliance repair shop, so I don't know how they found us.
I'm not sure if it's still the case, but it used to not be possible to trade certain Pokémon over the global trade system with their default name due to a filter like this.
I believe Nosepass and Cofagrigus were two of the affected.
[+] [-] jgrahamc|9 years ago|reply
And Eudora's Mood Watch feature would flag every single email I sent as offensive.
[+] [-] jwdunne|9 years ago|reply
Somehow delivery of order confirmation emails have never been an issue!
Edit: eeek I don't order from there, I just work with the website's host!
[+] [-] JadeNB|9 years ago|reply
[+] [-] cschmidt|9 years ago|reply
[+] [-] Toenex|9 years ago|reply
[+] [-] colemannugent|9 years ago|reply
Why Web Filters Don't Work: Penistone and the Scunthorpe Problem - https://www.youtube.com/watch?v=CcZdwX4noCE
It's well done, like the rest of his content.
[+] [-] Zuider|9 years ago|reply
[+] [-] HarryHirsch|9 years ago|reply
But that's a feature. The voting public sees that you are trying hard and failing, that's somehow considered better than shaking your head at the intractable problem.
[+] [-] madcaptenor|9 years ago|reply
[+] [-] erbo|9 years ago|reply
[+] [-] zitterbewegung|9 years ago|reply
[+] [-] cwmma|9 years ago|reply
[+] [-] prezjordan|9 years ago|reply
[0]: https://medium.engineering/the-unluckiest-paragraphs-751dd36...
[+] [-] TeMPOraL|9 years ago|reply
[+] [-] lb1lf|9 years ago|reply
Someone in the IT department figured it was an excellent idea to host all student accounts on the stud.ntnu.no subdomain.
We got a few odd bounces.
[+] [-] kbenson|9 years ago|reply
[+] [-] MattConfluence|9 years ago|reply
[+] [-] jap|9 years ago|reply
[+] [-] rjeli|9 years ago|reply
[+] [-] mikeash|9 years ago|reply
[+] [-] minimaxir|9 years ago|reply
[+] [-] TeMPOraL|9 years ago|reply
[+] [-] jfoutz|9 years ago|reply
> It also blocked e-mails sent in Welsh because it did not recognize the language.
With my (very) limited exposure to Welsh, i kinda get that it would give spam filters fits.
[+] [-] joss82|9 years ago|reply
I'm amazed how rarely this simple system is used. Instead you end up with monstrosities such as the power stars chat that mangles most words into unreadable mess of .
Could be a fun game though. Guess the words!
ertion
Weight and m
[+] [-] BillBohan|9 years ago|reply
[+] [-] davidddavidson|9 years ago|reply
[+] [-] cperciva|9 years ago|reply
[+] [-] jpindar|9 years ago|reply
[+] [-] InitialLastName|9 years ago|reply
[+] [-] ewrong|9 years ago|reply
[+] [-] grkvlt|9 years ago|reply
> Q: Can I provide my own wood? A: In most cases we can handle your wood.
I'm finding it hard to decide how intentional this is...?
[+] [-] js2|9 years ago|reply
[+] [-] kyle-rb|9 years ago|reply
I believe Nosepass and Cofagrigus were two of the affected.
[+] [-] unknown|9 years ago|reply
[deleted]
[+] [-] unknown|9 years ago|reply
[deleted]
[+] [-] UncleSlacky|9 years ago|reply