top | item 44228962 (no title) olavfosse | 8 months ago Very cool!What's the story for chunking PDFs?We've been using Marker and handling markdown->chunks manually. discuss order hn newest snyy|8 months ago Pretty much what you described. Convert the PDF to Markdown, join content across pages so that its all one string, then chunk it. Our evals show this approach works best.
snyy|8 months ago Pretty much what you described. Convert the PDF to Markdown, join content across pages so that its all one string, then chunk it. Our evals show this approach works best.
snyy|8 months ago