top | item 12387307

(no title)

ivanca | 9 years ago

You were claiming "they could just as well package into, say, an install CD image" which is likely not going to happen, so he was just giving an example.

The perfect spam filter Gmail has its probably based on data and ML analysis, not a simple software you can put on a CD and install on some little server; so even if someone who works there wanted to help you they probably can't do it.

discuss

order

stonogo|9 years ago

GMail's "perfect spam filter" is so bad that it is the reason my institution moved away from Google services. It's also the main reason so many people consider it 'impossible' to host your own email.

It's anything but perfect. It's a black box, with no user-serviceable parts inside. Completely useless for any organization with needs that deviate from Joe User.

newjersey|9 years ago

How would you implement a spam filter?

You can sort of turn off spam filtering on incoming emails but I don't most people or organizations would want that.

> Completely useless for any organization with needs that deviate from Joe User.

Maybe but being slightly over aggressive means a world in terms of user happiness. Think about email before Gmail. How much junk did you see in your inbox? There is a lot LESS today. Joe User would be very happy if they dried to think about it for a second.

Joof|9 years ago

We could probably build a distributed version of a spam filter by using a fancy online (single sample batch) training algorithm and combining models. I'm tired, so that exact solution may be infeasible, but I'll bet something similar is.

zAy0LfpBZLC8mAC|9 years ago

> You were claiming "they could just as well package into, say, an install CD image" which is likely not going to happen, so he was just giving an example.

What's the point of pointing out the one example among millions of email server setups that is most likely to not be published? Especially so, given that that would be about the most useless setup to replicate, as noone needs the scalability of gmail for their own setup, and thus the complexity that would come with it.

> The perfect spam filter Gmail has its probably based on data and ML analysis, not a simple software you can put on a CD and install on some little server; so even if someone who works there wanted to help you they probably can't do it.

If you think that gmail's spam filter is perfect, I have a very simple even more perfect spam filter: Just throw away all emails. Gmail has massive false positive rates, that's not a perfect filter, that's just a filter that throws away a lot of emails.

If anything, the control that google has over which emails it arbitrarily labels as spam is completely unacceptable, especially so given that they don't even accept liability for incorrectly filtered emails.