Changes Apache Commons Developers Base32.HEX_DECODE_TABLE contains the wrong value 32 Charsets Javadoc breaks build when using Java 8 Bug in HW rule in Soundex Javadoc for SHA-224 DigestUtils methods should mention Java 1.8.0 restriction instead of 1.4.0. Don't deprecate Charsets Charset constants in favor of Java 7's java.nio.charset.StandardCharsets Refactor HmacUtils methods into the HmacAlgorithms enum Add HmacAlgorithms.HMAC_SHA_224 (Java 8 only) Support JEP 287: SHA-3 Hash Algorithms Create a minimal Digest command line utility: org.apache.commons.codec.digest.Digest Add DigestUtils.getDigest(String, MessageDigest) Make some DigestUtils APIs public Add java.io.File APIs to MessageDigestAlgorithm BaseNCodecOutputStream only supports writing EOF on close() Support SHA-224 in DigestUtils on Java 8 Support java.nio.ByteBuffer in org.apache.commons.codec.binary.Hex Support java.nio.ByteBuffer in DigestUtils Add BaseNCodec.encode(byte[], int, int) input with offset and length parameters for Base64 and Base32. Add convenience method decodeHex(String). Add faster CRC32 implementation. Add Daitch-Mokotoff Soundex QuotedPrintableCodec does not support soft line break per the 'quoted-printable' example on Wikipedia Added clarification to Javadoc of Base64 concerning the use of the urlSafe parameter Added clarification to the Javadoc of Base[32|64]OutputStream that it is mandatory to call close() Add support for HMAC Message Authentication Code (MAC) digests Beider Morse Phonetic Matching producing incorrect tokens NullPointerException in DoubleMetaPhone.isDoubleMetaphoneEqual when using empty strings Make possible to provide padding byte to BaseNCodec in constructor Fix Javadoc 1.8.0 errors Deprecate Charsets Charset constants in favor of Java 7's java.nio.charset.StandardCharsets Fix Java 8 build Javadoc errors Update from commons-parent 34 to 35 Improve performance of Beider Morse encoder Beider Morse does not close Scanners used to read config files Base32 decode table has spurious value Link broken in Metaphone Javadoc Spelling fixes in Javadoc and comments Add DigestUtils.updateDigest(MessageDigest, InputStream). Add JUnit to test our decode with pad character in the middle. Add Match Rating Approach (MRA) phonetic algorithm encoder. ColognePhonetic encoder unnecessarily creates many char arrays on every loop run. Base64.encodeBase64URLSafeString doesn't add padding characters at the end. DigestUtils: Add MD2 APIs. DigestUtils: add APIs named after standard algorithm name SHA-1. DigestUtils.getDigest(String) should throw IllegalArgumentException instead of RuntimeException. Create a class MessageDigestAlgorithms to define standard algorithm names. DigestUtils.getDigest(String) loses the original exception. Remove unnecessary attempt to fill up the salt variable in UnixCrypt. Remove unnecessary call to Math.abs(). More tests and minor things. Added regression tests for PhoneticEngine based on Solr-3.6.0. BeiderMorseEncoder/PhoneticEngine: make results deterministic by using a LinkedHashSet instead of a HashSet. StringBuffer could be replaced by StringBuilder for local variables. DigestUtils: add updateDigest methods and make methods public. Add classes for MD5/SHA1/SHA-512-based Unix crypt(3) hash variants. Base64 encode() method is no longer thread-safe, breaking clients using it as a shared BinaryEncoder. Note: the fix breaks binary compatibility, however the changes are to a class (BaseNCodec) which is intended for internal use. Complete FilterInputStream interface for BaseNCodecInputStream. Use Charset objects when possible, create Charsets for required character encodings. BeiderMorseEncoder OOM issues. DoubleMetaphone Javadoc contains dead links. Base64InputStream.skip skips underlying stream, not output. Implement NYSIIS phonetic encoder. Use standard Maven directory layout. Documentation spelling fixes. Fix various character encoding issues in comments and test cases. Implement a Beider-Morse phonetic matching codec. Migrate to Java 5. Migrate to JUnit 4. ColognePhonetic Javadoc should use HTML entities for special characters. Added new Base32 encoder. new Base64().encode() appends a CRLF, and chunks results into 76 character lines. Many test cases use getBytes() which uses the default platform encoding so tests may fail on some platforms. Add test(s) to check that encodeBase64() does not chunk output. Base64 default constructor behaviour changed to enable chunking in 1.4. Base64.encodeBase64String() shouldn't chunk. Base64InputStream#read(byte[]) incorrectly returns 0 at end of any stream which is multiple of 3 bytes long. Typo in DecoderException message thrown from Hex.decodeHex. ArrayIndexOutOfBoundsException when doing multiple reads() on encoding Base64InputStream. Add the "Kölner Phonetik" encoder (Cologne Phonetic) to codec.lang. org.apache.commons.codec.net.URLCodec.ESCAPE_CHAR isn't final but should be. Base64.encodeBase64(byte[] binaryData, boolean isChunked, boolean urlSafe, int maxResultSize) throws IAE for valid maxResultSize if isChunked is false. org.apache.commons.codec.language.RefinedSoundex.US_ENGLISH_MAPPING should be package protected MALICIOUS_CODE. org.apache.commons.codec.language.Soundex.US_ENGLISH_MAPPING should be package protected MALICIOUS_CODE. DoubleMetaphone.maxCodeLen should probably be private. Remove deprecated package private method Base64.discardWhitespace(byte[]) Caverphone encodes names starting and ending with "mb" incorrectly. Split Caverphone class into two classes for Caverphone 1.0 and 2.0. Regression: Base64.encode(chunk=true) has bug when input length is multiple of 76. Base64 bug with empty input (new byte[0]). Make Base64 URL-safe. Allow for uppercase letters output in Hex.encodeHex(). Soundex and RefinedSoundex issues with character arrays. Base64.isArrayByteBase64() method is inefficient for large byte arrays. Thread safety and malicious code safety improvements. isBase64 throws ArrayIndexOutOfBoundsException on some non-BASE64 bytes. Fix case-insensitive string handling. Base64.encodeBase64() throws NegativeArraySizeException on large files. Implement Caverphone. Character set used by Base64 not documented. RefinedSoundex creates instance before al fields have been initialized. Digest on InputStreams. 2 Test failures in SoundexTest. Patch to add crypto-compatible BigInteger encoding support to Base64. DigestUtils: Add methods for SHA-256, SHA-384, and SHA-512. Using US_ENGLISH in Soundex caused an NullPointerException. Source tarball spews files all over the place. Base64.isArrayByteBase64() throws an ArrayIndexOutOfBoundsException for negative octets Streaming Base64 (Base64InputStream and Base64OutputStream added). Base64: Improve Code Coverage. Add methods to Base64 which work with String instead of byte[]. Base64's new constructor parameters ignored. Improve Double Metaphone test coverage. Double Metaphone bugs in alternative encoding. Make string2byte conversions indepedent of platform default encoding. BinaryCodec: Encodes and decodes binary to and from Strings of 0s and 1s. QuotedPrintableCodec: Codec for RFC 1521 MIME (Multipurpose Internet Mail Extensions) Part One. Rules #3, #4, and #5 of the quoted-printable spec are not implemented yet. See also issue CODEC-46. BCodec: Identical to the Base64 encoding defined by RFC 1521 and allows a character set to be specified. QCodec: Similar to the Quoted-Printable content-transfer-encoding defined in RFC 1521 and designed to allow text containing mostly ASCII characters to be decipherable on an ASCII terminal without decoding. Soundex: Implemented the DIFFERENCE algorithm. RefinedSoundex: Implemented the DIFFERENCE algorithm. This version is relesed under the <a href="http://www.apache.org/licenses/LICENSE-2.0">Apache License 2.0</a> , please see LICENSE.txt. Previous versions were released under the <a href="http://www.apache.org/licenses/LICENSE-1.1">Apache License 1.1</a> The Board recommendation to remove Javadoc author tags has been implemented. All author tags are now "Apache Software Foundation". The default URL encoding logic was broken. Base64 chunked encoding not compliant with RFC 2045 section 2.1 CRLF. Hex converts illegal characters to 255. Metaphone now correctly handles a silent B in a word that ends in MB. "COMB" is encoded as "KM", before this fix "COMB" was encoded as "KMB". Added missing tags in Javadoc comments. General Javadoc improvements. URLCodec: Implements the www-form-urlencoded encoding scheme. DigestUtils: Calculates MD5 and SHA digests. Modified Base64 to remedy non-compliance with RFC 2045. Non-Base64 characters were not being discarded during the decode. RFC 2045 explicitly states that all characters outside of the base64 alphabet are to be ignored. Hex.decode(Object) throws a ClassCastException when a String argument is passed in. Soundex: The HW rule is not applied; hyphens and apostrophes are not ignored. Soundex.setMaxLength causes bugs and is not needed. Calling Soundex.setMaxLength() with a value of 2 or less causes the wrong answer to be returned. Since the encoding returned by Soundex is always of length 4 by definition (we do not use the '-' in as a letter-nnn separator) the need for a maxLength attribute is not needed. Deprecate the field and accessor methods. Fix in Metaphone relating to the handling of the maximum code length. A newer version of the Base64 class reflecting improvements from both the commons-httpclient and xml-rpc versions of code forked from catalina. Base64 class from commons-httpclient in org.apache.commons.codec.base64 has been retained for backwards compatibility but has been deprecated. Soundex class from commons-util in org.apache.commons.codec. Metaphone class from commons-util in org.apache.commons.codec. RefinedSoundex class in org.apache.commons.codec. Encoder/Decoder interfaces in org.apache.commons. String and Binary specific Encoder/Decoder interfaces in org.apache.commons. StringEncoderComparator replaces the SoundexComparator from the language package. Base64 now discards whitespace characters when decoding encoded content. Base64 class from commons-httpclient in org.apache.commons.codec.base64. Soundex class from commons-util in org.apache.commons.codec. Metaphone class from commons-util in org.apache.commons.codec. SoundexComparator class from commons-util in org.apache.commons.codec.