liseen/main-content-extractor
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Repository files navigation
Description
A main content extractor for web page
extract title, main content, author and publish time
Limit
only web page used Chinese language supported
BUILD
ant -f build/build.xml