(no title)
Kelamir | 2 years ago
Hmm good insight there. I've done some experimenting formerly by chunk length and it's been pretty troublesome due to missing context.
Kelamir | 2 years ago
Hmm good insight there. I've done some experimenting formerly by chunk length and it's been pretty troublesome due to missing context.
gwern|2 years ago
chaxor|2 years ago
If you use local models then it's a fantastic idea.
screye|2 years ago
Define a custom recursive text splitter in langchain, and do chunking heuristically. It works a lot better.
That being said, it is useful to maintain some global and local context. But, I wouldn't use overlapping windows.
SmooL|2 years ago
crucialfelix|2 years ago
https://unstructured-io.github.io/unstructured/bricks.html#p...