Abstract: We present an efficient algorithm to automatically reformat text contained in multiple nodes that are spread out in a DOM tree of an HTML file converted from a PDF document. Reformatting ...