You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
C:\Users\Diego\Desktop\TEST> python aaa.py ocwikibooks-20140928-pages-articles.xml.bz2
Traceback (most recent call last):
File "c:\Program Files\Python32\lib\pickle.py", line 683, in save_global
klass = getattr(mod, name)
AttributeError: 'module' object has no attribute 'dec'
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "aaa.py", line 18, in <module>
run([sys.argv[1]])
File "aaa.py", line 13, in run
for title in xml_dump.map(dump, rev_info):
File "c:\Program Files\Python32\lib\site-packages\mediawiki_utilities-0.4.1-py3.2.egg\mw\xml_dump\map.py", line 72, in map
processor.start()
File "c:\Program Files\Python32\lib\multiprocessing\process.py", line 132, in start
self._popen = Popen(self)
File "c:\Program Files\Python32\lib\multiprocessing\forking.py", line 266, in __init__
dump(process_obj, to_child, HIGHEST_PROTOCOL)
File "c:\Program Files\Python32\lib\multiprocessing\forking.py", line 188, in dump
ForkingPickler(file, protocol).dump(obj)
File "c:\Program Files\Python32\lib\pickle.py", line 237, in dump
self.save(obj)
File "c:\Program Files\Python32\lib\pickle.py", line 344, in save
self.save_reduce(obj=obj, *rv)
File "c:\Program Files\Python32\lib\pickle.py", line 432, in save_reduce
save(state)
File "c:\Program Files\Python32\lib\pickle.py", line 299, in save
f(self, obj) # Call unbound method with explicit self
File "c:\Program Files\Python32\lib\pickle.py", line 627, in save_dict
self._batch_setitems(obj.items())
File "c:\Program Files\Python32\lib\pickle.py", line 660, in _batch_setitems
save(v)
File "c:\Program Files\Python32\lib\pickle.py", line 299, in save
f(self, obj) # Call unbound method with explicit self
File "c:\Program Files\Python32\lib\pickle.py", line 687, in save_global
(obj, module, name))
_pickle.PicklingError: Can't pickle <function dec at 0x0000000002812F48>: it's not found as mw.xml_dump.map.dec
C:\Users\Diego\Desktop\TEST>Traceback (most recent call last):
File "<string>", line 1, in <module>
File "c:\Program Files\Python32\lib\multiprocessing\forking.py", line 369, in main
self = load(from_parent)
EOFError
The text was updated successfully, but these errors were encountered:
I spent some time working on this today. Regretfully, the 7z support in python libs is bad. There exists pylzma which has a module py7zlib that provides a means to examine a 7z archive. I've been trying to use that to decompress an XML dump, but it looks like the system can't handle the massive file size (max 2GB).
Traceback (most recent call last):
File "", line 4, in
File "/home/halfak/.pyenv/versions/2.7/lib/python2.7/site-packages/py7zlib.py", line 576, in read
data = getattr(self, decoder)(coder, data)
File "/home/halfak/.pyenv/versions/2.7/lib/python2.7/site-packages/py7zlib.py", line 632, in _read_lzma
dec = pylzma.decompressobj(maxlength=self._start+self.size)
OverflowError: signed integer is greater than maximum
This happens in python2 or 3. There's some suggestions that you manage the file length yourself. E.g.: fancycode/pylzma#3
When running this
on Windows, the result is the following:
The text was updated successfully, but these errors were encountered: