GitHub - cbyeh/huffman-encoder

/**

Suppose we are using a method of encryption that does as much as possible to hide any patterns, including character frequencies. Do you believe there is any advantage (in terms of resulting file size) to compressing a file before encrypting it, as opposed to compressing it after encrypting it? In one sentence, say why or why not in the form of a hypothesis.

It should be better to compress before encrypting, as encrypting usually results in more unique characters, which leads to a bigger huffman trie.
Test your hypothesis on the command line using warandpeace.txt, your compression program, and the command gpg --batch -c -z0 --passphrase . The -z0 flag is very important because it prevents GPG from doing its own compression. You can read more about GPG at https://www.gnupg.org/. You may use the reference compression program if there is a problem with yours.

What file size results from compressing before encrypting with gpg?

1,870,825 bytes

. What file size results from compressing after encrypting with gpg?

3,289,150 bytes

. Do these results agree with your hypothesis?

Yes, it makes sense.

Suppose instead of GPG we encrypted by XORing each byte in the file against a single shared one byte key. This is effectively a byte substitution cypher. Would this compress better, worse, or the same as a file encrypted with GPG? In one sentence, why?

It would compress better, as the number of unique characters would be the same as the original file.
Would it compress better, worse, or the same as the original unencrypted file? In one sentence, why?

It would compress the same, as the number of unique characters would be the same as the original file.
Would the Shannon entropy be higher, lower, or the same as the original unencrypted file? In one sentence, why?

It would be the same, as each byte exclusively XORs to its own unique byte, so every byte has the same frequency.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Programs		Programs
BitInputStream.cpp		BitInputStream.cpp
BitInputStream.hpp		BitInputStream.hpp
BitOutputStream.cpp		BitOutputStream.cpp
BitOutputStream.hpp		BitOutputStream.hpp
HCNode.cpp		HCNode.cpp
HCNode.hpp		HCNode.hpp
HCTree.cpp		HCTree.cpp
HCTree.hpp		HCTree.hpp
Makefile		Makefile
compress.cpp		compress.cpp
readme.md		readme.md
uncompress.cpp		uncompress.cpp

Provide feedback