Scribd says: Beat our programmers at a coding challenge

[+] tazzy531|13 years ago|reply

For anyone that is doing CS at Columbia University or considering CU, if you like these types of coding challenges, definitely look into Professor Ross's Programming and Problem Solving class. It was one of the best CS classes I took at Columbia.

Here is his website: http://www.cs.columbia.edu/~kar/teaching.html

Each semester consists of 4 different open ended programming problems like these. You work as a team to compete against other members of the class. There's no tests and each class is run as an open seminar where people talk about their strategy and implementation and consider the best approach to solving these problems.

This was my favorite from my year: http://www.cs.columbia.edu/~kar/4444f02/node18.html

[+] exim|13 years ago|reply

I wish lecture notes and videos were available for this course...

[+] pcopley|13 years ago|reply

It always blows my mind how many CS academics have these god-awful mid-90s era webpages.

[+] qbrass|13 years ago|reply

On the game rules page: http://www.scribd.com/jobs/botrace_rules

The game starts off with Apples Bananas and Oranges, but ends with Apples, Bananas and Melons.

Robots are a wise choice for safety's sake when dealing with quantum fruit.

[+] matthiaskramm|13 years ago|reply

Fixed, thanks :) We'll deploy a new page tomorrow.

[+] Kototama|13 years ago|reply

I've got a coding challenge for the scribd team: make documents downloads one-click away from the document page and without mandatory login.

[+] a1k0n|13 years ago|reply

You may need to consider a better way to rank players - percentage of games won is problematic (as was discovered during the Tron Google AI challenge) and something like TrueSkill or bayeselo would be more robust.

[+] matthiaskramm|13 years ago|reply

Yeah, we're seeing the first problems with that right now- newly uploaded bots progress faster than they should because they play against a more "diluted" pool of opponents than the oldtimers did. We'll think of something.

[+] maayank|13 years ago|reply

The more I think about it, the more I like it as a recruiting venue[1]:

* A good way to measure the potential recruit skills

* Gives a good fun experience to go over later at the interview, both verifying it is the recruit who coded it and setting common background beforehand

* You get people who showed some interest in your company, seeing the contest

* You get people who are interested in hacking around puzzles

[1] (I don't know if scribd sees it that way)

[+] backspace|13 years ago|reply

Adding to what you said, someone who discovers, attempts and solves this solution is already better than 70% of the programmers out there. They show promise of being motivated to crack a puzzle that robs them of their sleep, and take the time to learn and code against the Scribd API.

Even if they don't "win", they are greatly superior than a normal resume pusher.

[+] madmike|13 years ago|reply

Absolutely. We've already used this game for some talking points during interviews.

[+] cfqycwz|13 years ago|reply

Smart that they implemented a 10-second time limit, because my first thought was to write a bot that would pass the current game state to Mechanical Turk and have a person solve it. It would be interesting to see how a human fared against Scribd's developers' bots.

[+] tjsnyder|13 years ago|reply

I'm pretty sure I was the guinea pig for this system and I have to say, it was one of the more enjoyable interview tasks I have had to perform.

There is just something extremely satisfying about watching your little robot run across a field collecting little fruit.

[+] rorrr|13 years ago|reply

If we started asking this as an interview question, NOBODY would pass. Time after time we have "senior" developers who fail to write very simple functions, like to count the number of occurrences of each letter in a string.

For me, personally, it sounds like an awesome question, but it's not clear how you would judge it. Let interviewees' bots fight?

[+] gridspy|13 years ago|reply

...Must keep working on Gridspy.... Must not enter fun contest...

twitch

[+] matthiaskramm|13 years ago|reply

Btw. here's a game, played between our two best bots so far:

http://www.scribd.com/job_game/match/263907

[+] brown9-2|13 years ago|reply

Seems to be returning an error page at 10:19am EST

[+] icefox|13 years ago|reply

Can we use nacl?

[+] unknown|13 years ago|reply

[deleted]

[+] MichaelGlass|13 years ago|reply

there's a tiresome amount of dick referencing in the programming world.

(fyi, one half of the team who built the game does not possess a dick)

[+] cluda01|13 years ago|reply

Comparing yourself to others is a good way to gain perspective on how far you've come, and more importantly, how much further you have to go to attain some desired level of skill. Taken to extremes predictably ends up hurting more than helping.

[+] sp332|13 years ago|reply

How could this be framed better? It's already pretty friendly.

[+] lacksconfidence|13 years ago|reply

For people that havn't studied algorithms much, where is a good start on game playing algorithms? Pointers to papers, or names of data structures to look up? Perhaps http://erikdemaine.org/papers/AlgGameTheory_GONC3/paper.pdf ?

[+] tikhonj|13 years ago|reply

You should look at the basics of AI and various search algorithms.

The first priority would be to quickly pick up the vocabulary: what is a state, what is a utility function and so on.

I recently took a class on AI and we used the (rather popular) Artificial Intelligence: A Modern Approach by Russell and Norvig [1]. I didn't actually read the book, but I've heard good things about it so it's definitely worth a look.

[1]: http://aima.cs.berkeley.edu/

All the lectures for the course are available online as well[2]. The professor my semester (Dan Klein) was a brilliant lecturer, so the lectures are worth watching if you have the time. The lecture notes[3] are also online.

[2]: http://webcast.berkeley.edu/playlist#c,d,Computer_Science,9C...

[3]: http://inst.eecs.berkeley.edu/~cs188/fa11/lectures.html

Of course, if you want to participate in this game, you are doubtless in a hurry. So you might want to skim through lectures 2, 3, 6, 7, 10, 11 in roughly that order--they seem to be the most pertinent.

[+] Jach|13 years ago|reply

Combinatorial game theory would be the wrong tool for this, I believe, since this isn't a strictly finite turn-based combinatorial game. (And in my opinion combinatorial game theory is more useful for analysis and proof techniques than for creating AI algorithms, but I've only had one class on it.) The blog mentioned "fruitwalk-ab" uses minimax (with the alpha-beta optimization of course), which is the bread-and-butter algorithm for these kinds of games and I expected it to be in 1st place. Sure enough, it is at the moment. (Edit2: No longer.)

In an intro machine learning course you'd learn about minimax and others, but skip paying any money and just read the basic algorithms here and look up the wiki pages for even more examples: http://www.stanford.edu/~msirota/soco/blind.html (The introduction has some term definitions.)

Edit: Also, the obligatory plug for Artificial Intelligence: A Modern Approach http://www.amazon.com/Artificial-Intelligence-Modern-Approac...

[+] Trufa|13 years ago|reply

Hey, honest question here, is it impossible to brute force this and consider (almost) every possible move and get to a non possible to loose point? Given that the board is restricted to 15x15 and the fruits to 81... I'm not saying this is easy, I'm asking if the game AI has an upper limit so to say...

[+] egowaffle25|13 years ago|reply

The instructions say you need to make a move in 10 seconds, but that's the only ceiling I can imagine for this approach.

[+] rorrr|13 years ago|reply

Search space = 4^(15x15) = 2.9 × 10^135

Impossible.

[+] bravura|13 years ago|reply

Given that each move must be made in 10 seconds, here's my approach. I'm proposing that you use machine learning to evaluate moves, to approximate the exact (search-based) scoring. I'm not implementing this, but feel free to ask me for more details:

Training phase:

Create a random board configuration. Exhaustively explore the search space to find whether this is a win +1, lose -1, or draw 0 for Player 1. You now have a training example: (board configuration, game outcome)

Now, train a neural network (or other non-linear ML model, e.g. SVM) to predict the expected outcome based upon the board configuration.

Deployment phase:

Port the neural network to Javascript. For each possible move, use the neural network to predict the outcome of that move. Pick the move with highest expected outcome. The neural network will run in constant time, most likely well under 10 seconds per move.

[+] alxv|13 years ago|reply

> Create a random board configuration. Exhaustively explore the search space to find whether this is a win +1, lose -1, or draw 0 for Player 1

Given that the search space can grow O(4^mn), this can be done only for endgame configurations. Further, not knowing any bounds on the board size makes the input representation difficult to define for a such machine learning approach. And, your target should probably be the weights of an evaluation function, rather than the exact game outcome.

As for the learning algorithm, I know TD-learning was found to be a good approach in various chess programs.

> For each possible move, use the neural network to predict the outcome of that move. Pick the move with highest expected outcome.

You would likely still want to run an alpha-beta search to pick the move to minimize the prediction error.

[+] epaik|13 years ago|reply

Hey, I'm writing a bot for this.

One thing though: I think resetting the board state should call new_game() again. This likely only matters when testing your bot, but it's nice for variables that need to be instantiated per each game.

It's pretty fun! My current greedy solution is called: SoGreedy. :)

[+] matthiaskramm|13 years ago|reply

Fixed, thanks!

Your bot is doing pretty well already. :)

[+] michaelvillar|13 years ago|reply

[Sorry for bug report here, didn't find another way] I think there is a bug with get_my_item_count() and get_opponent_item_count(). These two functions don't return decimal. When players have 0.5, they return 0. This works locally though.

Example of failed match : http://www.scribd.com/job_game/match/295703 This line for example 1 / 1 / 0 / 0 / 1 - first number is the fruit type - second number is the total fruits - third number is my fruits - fourth number is opponent fruit - fifth number is irrelevant (You'll never see 0.5 in the third or fourth number..)

[+] JesseAldridge|13 years ago|reply

Bot compilation seems to be stuck on this guy: http://www.scribd.com/jobs/botrace_bot/58

[+] matthiaskramm|13 years ago|reply

Fixed.

[+] michaelvillar|13 years ago|reply

I think there is some kind of bug where a bot is stopping. Example : http://www.scribd.com/job_game/match/275573 I simulated two boards where this bug happens online, and it doesn't happen locally.

Also, the API is wrong here : "return the index of that fruit (starting with 0)" => It starts at 1

[+] matthiaskramm|13 years ago|reply

Thanks! We'll fix the API docs.

About the bug: Try to trace() out a few messages in your make_move() function, that might tell you whether something went wrong (the logs will appear at the bottom of the game replays.)

[+] danso|13 years ago|reply

I was going to post how they should do a clearer job of describing the challenge (i.e. using bullet points, headers) and main rules, but I guess there's a separate webpage devoted to the challenge:

http://www.scribd.com/jobs/botrace_rules

[+] evanlong|13 years ago|reply

Scribd has Kramm. You have already lost.

[+] davidjhamp|13 years ago|reply

I think they need to reevaluate how they score a tie, out of 58 games I have 4 losses and 30 wins and the rest are ties, but my percentage of games won is only 50%ish. Seems like a tie shouldn't be put in the same category as a loss.

107 comments