diff options
| author | Mike Buland <eichlan@xagasoft.com> | 2011-04-04 14:59:13 +0000 | 
|---|---|---|
| committer | Mike Buland <eichlan@xagasoft.com> | 2011-04-04 14:59:13 +0000 | 
| commit | 6ff23ac5097f7d92ac8840c2ba17d1dbf1eb80a5 (patch) | |
| tree | fc70404d66854bba713bff2350f5f69f43bd85bc /test.utf16le | |
| parent | abbf45c1da7f3e3a542e6c6339a1bab31283f22e (diff) | |
| download | libbu++-6ff23ac5097f7d92ac8840c2ba17d1dbf1eb80a5.tar.gz libbu++-6ff23ac5097f7d92ac8840c2ba17d1dbf1eb80a5.tar.bz2 libbu++-6ff23ac5097f7d92ac8840c2ba17d1dbf1eb80a5.tar.xz libbu++-6ff23ac5097f7d92ac8840c2ba17d1dbf1eb80a5.zip | |
UtfString is going really well.  It can now parse Utf8, Utf16 (le,be), and
Utf32 (le,be).  The internal storage seems to be working fine, although we do
have a problem with random access, but at least we can tell which half of a
surrogate pair we're on, so we can always rapidly determine the entire code
point from any utf16 index that we're on.
The only optomization that I'm not doing yet is reading in entire 16bit or 32bit
words at a time and converting them from their byte order to native.  There are
a few potential issues with that, so we'll see.
I added a couple of testing datafiles and a test program, I'll delete them all
just as soon as it's verified to write correctly.
Diffstat (limited to '')
| -rw-r--r-- | test.utf16le | 1 | 
1 files changed, 1 insertions, 0 deletions
| diff --git a/test.utf16le b/test.utf16le new file mode 100644 index 0000000..9f610d6 --- /dev/null +++ b/test.utf16le | |||
| @@ -0,0 +1 @@ | |||
| ÿþ¥Ëæ˜)=Ø<ÞÿÛýßH$ \ No newline at end of file | |||
