A hybrid approach for content extraction with text density and visual importance of DOM nodes

Dandan Song, Fei Sun, Lejian Liao. A hybrid approach for content extraction with text density and visual importance of DOM nodes. Knowl. Inf. Syst., 42(1):75-96, 2015. [doi]

Authors

Dandan Song

This author has not been identified. Look up 'Dandan Song' in Google

Fei Sun

This author has not been identified. Look up 'Fei Sun' in Google

Lejian Liao

This author has not been identified. Look up 'Lejian Liao' in Google