- Keep parsed line info in blocks - Keep span/ char info in the Line - Extract char bbox information from OCR on mac