Skip to content

FeatureRequest: Migrate to a faster xml parser #10

@groceryheist

Description

@groceryheist

Others have reported improved performance when using expat to parse Wikimedia dumps. We are currently using ElementTree which provides a good balance between usability and speed.

There is probably potential to speed up this library by switching to a faster xml parser. Candidates include:

  • lxml
  • cElementTree
  • expat

Migrating to lxml or cElementTree might be relatively easy because they have similar APIs to ElementTree.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions