top | item 41489435

(no title)

erickj | 1 year ago

If I was going to build this prototype I'd start with just a semistructured textual play by play recap as the input. Also including roster, injury, amd schedule information with a fairly basic prompt would probably go a long way.

This data exists for most live games at this point via various web services. I'm sure espn has significant resources internally to source that info

discuss

order

skipants|1 year ago

I don't think ESPN does anything that takes significant resources. That's all handled by SportsRadar or ... there's another big provider but their name alludes me. They basically firehose you all the game information as structured data and you can use it programmatically however you'd like.

mh-|1 year ago

I assume this is what lets baseball games show obscure factoids like "3rd in the NL West when facing left-handed pitchers on Tuesday"?

mason55|1 year ago

Yeah it feels like the ideal way is to feed in a transcript of the announcer audio + some standard stats. That would ensure you catch both the human stories & the factual content.

But I wonder if there are licensing issues with using the audio/transcript to generate your summary. I know that the raw stats are public domain but I wouldn't be surprised if they can't use the transcripts or audio.