How FriendFeed uses MySQL to store schema-less data

[+] joshu|16 years ago|reply

I probably should write up the (very similar, but with a few significant differences) my design for the never-executed Delicious backend datastore.

[+] eserorg|16 years ago|reply

That would be very interesting to read.

We're looking at implementing a tagging system for navigating through a large proprietary datastore of oil and gas well data.

The problem we're running into is scaling conjunction queries without blowing-out our hardware budget -- 100's of tags per item with tens of millions of items, updated daily.

[+] mnmdev|16 years ago|reply

I had seen your comment on the original article and was very interested. At the time I checked your site just in case you had written on the subject. I think this would be very useful.

[+] bravura|16 years ago|reply

Please, I'd find that very interesting.

[+] hackernews|16 years ago|reply

this was discussed 413,442 posts back

http://apps.ycombinator.com/item?id=496946

[+] dpapathanasiou|16 years ago|reply

That strange feeling you sometimes get: http://www.youtube.com/watch?v=G2eUopy9sd8

[+] JoeAltmaier|16 years ago|reply

This matches all my intuitions about how I was going to do this; now I have a cookbook (several recipies!)! This is why Hacker News is so valuable. Nothing beats experienced advisors.

[+] SingAlong|16 years ago|reply

I tried this method for my app. But it turned out that I needed a lot of fields to be in their own columns since i would have to use them with queries in the WHERE clause or ORDER clause.

So be careful while choosing the columns you would like to serialize or json-ify and dump.

Just my 2c.

[+] technoweenie|16 years ago|reply

Part of this article talks about the index tables they use. The rationale for having separate index tables is they can create or drop them without locking tables or anything like that. It takes a little more manual work to keep things updated though.

I too use a hybrid approach on my site. Only columns that aren't specifically referenced in indexes aren't kept in the serialized attribute hash.

[+] cnu|16 years ago|reply

Check out a open sourced version written on top of SQLAlchemy. http://github.com/bickfordb/waffle

13 comments