Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert MediaWiki content into raw text documents #175

Closed
6 tasks
renoirb opened this issue Jul 2, 2015 · 3 comments
Closed
6 tasks

Convert MediaWiki content into raw text documents #175

renoirb opened this issue Jul 2, 2015 · 3 comments

Comments

@renoirb
Copy link
Member

renoirb commented Jul 2, 2015

In order to allow the docs page to use git as edition system, we need to export the full content into raw text files.

High level description

  • Export all content into simple as possible markdown
  • Strip off any special feature, but keep raw content

Estimated tasks

  • Find a way to grab content from MediaWiki and be able to work with content to format how we want to export it
  • Set in place filter to strip Extension output that we won't need; e.g. splist, ...(?)
  • Crawler that lists all pages
  • Raw file dumper with path matching the url scheme, default is english (e.g. /css/properties/border-radius could write file in path css/properties/border-radius/en.md)
  • If url ends by two letter code (e.g. zh, fr), make the file name to be called the slash before, inside the appropriate folder, e.g. /html/ja into html/ja.md
  • Nice to have Find way to get all contributors from API, but don't make it part of import just yet.
@renoirb
Copy link
Member Author

renoirb commented Jul 2, 2015

Victory!!

Could find a way to leverage MediaWiki Parsoid and add a new output manager from HTML to Markdown.

Convert MediaWiki markup into Markdown preview

@renoirb
Copy link
Member Author

renoirb commented Aug 4, 2015

Duplicate of #177 closing this one.

@renoirb
Copy link
Member Author

renoirb commented Aug 4, 2015

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant