top | item 15715322

Southwest Airlines is intimidating our startup

69 points| crobertsbmw | 8 years ago |swmonkey.com | reply

72 comments

order
[+] ars|8 years ago|reply
Calling it Southwest monkey is clearly a mistake.

You have little argument there, and you should change that because to a 3rd party it could look like a service offered by Southwest itself.

The part about scraping, etc, is more complicated and you may want to fight that.

See: http://blog.icreon.us/advise/web-scraping-legality and https://arstechnica.com/tech-policy/2017/08/court-rejects-li... as two quick examples I found on previous cases that have gone to court.

[+] romwell|8 years ago|reply
Really? It's bad enough when common words get trademarked, but they didn't even call themselves "Southwest Monkey". They called it "SWMonkey".

Without extra context (knowing what the website is for by reading the description), I'd have no clue that "swmonkey" has anything to do with airlines.

Anyhow, that's a separate issue. Southwest was clearly less concerned about trademark infringement than they were with the business model. The contents of the email clearly indicate that they want to disallow scraping with their ToS, and want to enforce that.

[+] karljtaylor|8 years ago|reply
I've never understood why people do that. It's not as though you'd have much more work on your hands with a "safe name" from a promotions standpoint.
[+] mirimir|8 years ago|reply
Craigslist has litigated the scraping issue heavily, and precedents are bad for SWMonkey.

https://newmedialaw.proskauer.com/2017/04/17/craigslist-garn...

Edit: But the LinkedIn case seems favorable: https://arstechnica.com/tech-policy/2017/08/court-rejects-li...

[+] nslocum|8 years ago|reply
I know of a company that scrapes craigslist in bulk. The owner insisted that he's had no problems. Importantly, his company is not based in the US. Maybe he just ignores their c&d letters.
[+] mapgrep|8 years ago|reply
"Everyone has a right to access public data. And we believe that whether you are accessing that data by typing in a URL in a web browser, through a CURL request, an RSS feed, a cached copy, or having someone read it to you aloud, does not change your right to access public data. "

I am not a lawyer and this is not legal advice, but I believe this is, sadly, dead wrong. The infamous Computer Fraud and Abuse Act contains a provision barring not just unauthorized access to computer systems but also accessing such systems in a manner that exceeds authorized access. In other words, if you break terms of service on a website, you may be in violation of the CFAA.

The ACLU last year filed suit to overturn this provision of CFAA, on the grounds that it chills research into civil rights violations, as well as academic research and journalism. https://www.aclu.org/cases/sandvig-v-sessions-challenge-cfaa...

[+] fjabre|8 years ago|reply
The law is often wrong and written by the more fortunate in society.

The data is publicly available. The reasons this should not be an issue are self evident.

Google scrapes trillions of sites every second of every day. Where's the outrage in that sir? Or the legalities. Oh right the law doesn't apply to them. Just small indie devs.

I don't hide behind legal speak and lawyers. I stand behind the truth of the matter. I'd say any legal argument against non-malicious scraping is dead wrong on moral and ethical grounds. Lawyers and powerful corporations will always try to stamp out the little guy to protect their precious trademark or data because their intellects are too dull and mediocre to compete with new entrants or innovations, so they call and cry about it to their lawyer instead. It's easier.

[+] noer|8 years ago|reply
So, the Southwest TOS state that you can't scrape or programmatically access their data. Isn't that the end of the argument unless they find a way to manually access it for customers?
[+] godelski|8 years ago|reply
I would think so. That's why you don't see SW in Google flights or any other program. SW wants you to only get it from their website. I'd imagine if there was more to the discussion Google would fight it.
[+] revelation|8 years ago|reply
Is visiting a website now some sort of contract? I have no contractual relationship with Southwest, what do I care for their TOS?

In the EU all TOS are basically just a reiteration of standard consumer law, you can put into yours whatever you want but none of that nonsense survives a legal challenge.

[+] petercooper|8 years ago|reply
Given they're charging for this service, maybe there's a way to make it work economically by having low cost workers (e.g. Mechanical Turk or even a bank of workers in certain countries) perform the checks manually (you could even create a browser extension that automates the form filling, but leaves a human to push the button and parse the results). At least by offering it as a paid service, they have some sums they can run to see if manual would work.
[+] lurker12390879|8 years ago|reply
Set it up so the user's browser scrapes it when a button is pressed, and send a reminder every X days as chosen by the user....

User comes to your site, clicks a button, and Southwest's page loads in an iframe, where js does the lifting.

[+] virtualwhys|8 years ago|reply
My brother-in-law sunk a million dollars of his own money into a similar real-time airline pricing service, all of which was thrown away when his now defunct company got banned for scraping.

There are only a few blessed companies that are allowed to scrape airline data (not surprisingly, big players in the market). If you haven't been granted permission and don't comply with their cease and desist you'll be sued and/or have your scraper IPs blacklisted.

It's a tough business to get into.

[+] dstaten|8 years ago|reply
"The problem I see with their argument is that they are making this information public."

I love this idea! However, I think that unfortunately this argument doesn't hold up, and wouldn't in court. The information is available via their site, but that does not make it public.

[+] fjabre|8 years ago|reply
Says who? Them? Right.

If the info is on the web it's available to be consumed by humans or robots in a polite manner.

Data wants to be free.

Them trying to enforce their terms is another matter. Maybe it would hold up in court. Stupider things have. I mean they should start telling us how to breathe air next. Because ya know if it's in your terms then it must be legally binding.

[+] amorphid|8 years ago|reply
> The information is available via their site, but that does not make it public.

Could one offer fair information if there's no reference to Southwest? Is having a website full of listings like "flightno: 123, flight_datetime: 2018-03-04-1045, price: 234.56, price_datetime: 2017-11-15-1130" something Southwest could successfully block?

[+] oh_sigh|8 years ago|reply
If the information is available to any member of the public that requests it with no conditions at all, why is the information not considered public?
[+] willyyr|8 years ago|reply
Might be worth to check out the recent LinkedIn vs. startup scraping case: https://www.reuters.com/article/us-microsoft-linkedin-ruling...
[+] parenthephobia|8 years ago|reply
This is mis-reported. This isn't a general ruling on whether scraping sites in violation of the explicit instructions of the site owners is allowed.

The judge granted an injunction prior to the trial proper, permitting HiQ to continue scraping LinkedIn in the lead up to the trial. He did this because HiQ credibly argued that if it couldn't scrape LinkedIn it would go out of business before the court had even determined whether what it was doing was legal. These sorts of injunctions are a procedural matter and quite normal.

https://en.wikipedia.org/wiki/Preliminary_injunction has more.

[+] sp332|8 years ago|reply
it was a public domain image, licensed under creative commons.

Unless it was CC0 (which is not what was linked), Creative Commons licenses are not public domain licenses. Someone still holds the copyright and you have to abide by the license terms. Secondly, CC and "public domain" are about copyright, not trademarks. It's totally possible to infringe someone's trademark even with a public domain image.

[+] fjabre|8 years ago|reply
Good for you. I completely agree with your sentiment re: scraping. It's ok if Google or another big company does it but god help you if you do as an indie dev. Thank you for sharing this. I commend you.

I've scraped millions of records from all kinds of companies big and small, politely of course, and I will definitely continue to do so at my discretion for ideas. In your case I would've scraped southwest without hesitation.

I'd make sure to distance yourself from the trademark as much as possible. Maybe even remove "sw" from the domain name but otherwise I don't see how they have a case.

TOS trying to enforce anti-scraping measures is a joke and makes a mockery of the judicial system.

[+] j45|8 years ago|reply
Flightmonkey might be a better name. Apps like Hopper appear to be monitoring flight prices just fine, not sure if they include SW, though.

Doing flight searches via other travel platforms/apis may also be legal.

[+] pwinnski|8 years ago|reply
"And we believe that whether you are accessing that data by typing in a URL in a web browser, through a CURL request, an RSS feed, a cached copy, or having someone read it to you aloud, does not change your right to access public data."

I absolutely do have the right to access the data. However, it's not clear why you should have the right to access the data and then monetize redistributing it to me.

[+] praneshp|8 years ago|reply
The landing page (https://www.swmonkey.com/) still has the workd "Southwest" used several times, and the picture of what is obviously a Southwest plane. I wish common sense prevails and this (obviously very useful) service doesn't die because the founder can't control his ego.
[+] crobertsbmw|8 years ago|reply
Founder here: There is one case where I use the name SouthwestMonkey instead of swmonkey. I realize that that was a bad naming decision. All the other references to southwest are to the Airline and not this service.
[+] mankash666|8 years ago|reply
Maybe you can crowdsource pricing info. That way, you're not scraping, and users are voluntarily sharing what they've searched.
[+] justgottasay3|8 years ago|reply
That would be one way to comply with the letter of the TOS. There will be some issue of getting enough traction from users that obey the TOS to make the site useful, though.

I think crowdsourcing would open up SWmonkey to 'attack' from SW, where SW would flood the channel with conflicting info at a rate that would make the SWmonkey site useless to legitimate customers.

Also, most of the companies that offer fare comparisons do not show SW flights... Some of those companies have the tech chops to implement 'fixes' that would comply with the SW site TOS and the massive legal departments to defend their position/approach in any and all courts in the world.

[+] jerf|8 years ago|reply
That's got a "boil the ocean" problem. It's probably easier to imagine it on a per-flight basis; how many people in a set of people who may look at a given flight are going to be swmonkey users? You have to get that number above a certain threshold before that would work, but it's hard to get that number above a certain threshold because nobody will want to use the service until you're there.

Plus, with the way airline pricing works now, just because Alice sees one price doesn't mean Bob will see it ten minutes later. The faster the prices change, the higher the percentage of people you need to have crowdsourcing.

And I've just assumed that "crowdsourcing" looks like "install a browser extension". If it involves "typing numbers into another site" you can expect a participation rate indistinguishable from 0%. I've also assumed Southwest remains oblivious to this and never takes any actions to counter what they consider an undesirable use, which is also unrealistic.

[+] bikamonki|8 years ago|reply
Make it an app not a website. Users can download and install it, enter flight data and click to start monitoring.
[+] dawnerd|8 years ago|reply
And support more airlines and not call yourself southwest monkey. Plenty of airlines have policies about lower prices so there is a market for this.
[+] Danihan|8 years ago|reply
And then SW has it removed from the app stores in about two seconds...
[+] dzdt|8 years ago|reply
Does putting a human in the loop make scraping okay? Can there be a button on the web page : "please click here to run a price check for one of your fellow travelers" that initiates a web page being fetched, attached to a round-robin queue of fares that are being monitored?
[+] crobertsbmw|8 years ago|reply
I don't think so. All they would have to do is change their terms in conditions to prohibit round robin schemes and then send me a new cease and desist...
[+] anweshi|8 years ago|reply
Google flights doesnt show SW pricing. That itself should have been a red flag for the creators.
[+] bob_theslob646|8 years ago|reply
Seriously! It is almost as if they thought there was no reason why it didn't appear on Google....

Remember ignorance is bliss

[+] lanevorockz|8 years ago|reply
We are giving up on free speech to avoid people been offended, of course lawyers in big companies will find a way to use that to their advantage.
[+] fjabre|8 years ago|reply
Lost faith in the crowd here after reading these comments, against web scraping. Isn't this HACKERnews!? This is the same witch hunt mentality that ultimately lead to the prosecution of Aaron Swartz and look how that turned out.

The anti-scrapers: You should all be ashamed of yourselves. Where's that spirit of innovation and naughty-ness that PG always talks about in his posts?