(no title)
jbondeson | 9 years ago
On 3D deformation, you're officially in academic research land. Nearly all algorithms require you to have a solid guess as to what the aspect ratio of the target object is. Other algorithms use heuristics based upon what you expect to find on a page. One particularly fun algorithm used the baseline of text (I believe for that paper it was Arabic) and fit a high-order curve to it which was then reversed. Unfortunately I haven't seen a truly generic approach that doesn't require a implementation-specific input.
[1] Frankly my feeling is that RGB to grayscale is a mistake and holding back many of these algorithms
yxiongdropbox|9 years ago
Agree with that 3D deformation is a difficult open problem, and we haven't gotten into that yet. Currently we assumed the document is a flat rectangle, which maps to a quadrilateral in image space. A homography is then applied to rectify it, and it seems to work quite well if the paper is slightly curved or folded.
jbondeson|9 years ago
Great work, and I look forward to seeing future posts on the solutions you've been able to come up with!
prashnts|9 years ago
Thanks for this additional information, much appreciated!