diff options
| author | Mike Buland <eichlan@xagasoft.com> | 2010-11-19 05:54:14 +0000 |
|---|---|---|
| committer | Mike Buland <eichlan@xagasoft.com> | 2010-11-19 05:54:14 +0000 |
| commit | 7c335ede527eaf4a3053ef35b1299141d34aaf40 (patch) | |
| tree | 5ececc9181090cce540a5e4fbe28eda97e4e5c2b /src/utfstring.h | |
| parent | 2fe32ba19571ff775a55f61eca355a46f269393e (diff) | |
| download | libbu++-7c335ede527eaf4a3053ef35b1299141d34aaf40.tar.gz libbu++-7c335ede527eaf4a3053ef35b1299141d34aaf40.tar.bz2 libbu++-7c335ede527eaf4a3053ef35b1299141d34aaf40.tar.xz libbu++-7c335ede527eaf4a3053ef35b1299141d34aaf40.zip | |
I now think that this may not work out at all. It looks like if we want proper
Unicode handling we'll need to implement a series of codecs and converters as
well as tables of codepages and lookups. It'll be interesting, I guess, but
it makes me care a lot less about proper encoding. Anyway, UtfString uses
shorts instead of chars, so it's a step in the right direction, but still not
enough to be able to handle proper UTF-16 encoding, maybe UCS-2 encoding, but...
...that's lame. Bu::FBasicString has been generalized a bit with optimizations
from libc for char based strings. It also, unfortunately, still uses char-only
functions in several places, those all rely on char casting strings at the
moment just to get the thing to compile. Basically, it's not a good UTF-16
solution yet, and it may never be and remain compatible with char based strings.
Diffstat (limited to '')
| -rw-r--r-- | src/utfstring.h | 24 |
1 files changed, 24 insertions, 0 deletions
diff --git a/src/utfstring.h b/src/utfstring.h new file mode 100644 index 0000000..bbacb74 --- /dev/null +++ b/src/utfstring.h | |||
| @@ -0,0 +1,24 @@ | |||
| 1 | #ifndef BU_UTF_STRING_H | ||
| 2 | #define BU_UTF_STRING_H | ||
| 3 | |||
| 4 | #include "bu/fbasicstring.h" | ||
| 5 | |||
| 6 | namespace Bu | ||
| 7 | { | ||
| 8 | typedef FBasicString<short> UtfString; | ||
| 9 | |||
| 10 | template<typename T> | ||
| 11 | uint32_t __calcHashCode( const T &k ); | ||
| 12 | |||
| 13 | template<typename T> | ||
| 14 | bool __cmpHashKeys( const T &a, const T &b ); | ||
| 15 | |||
| 16 | template<> uint32_t __calcHashCode<UtfString>( const UtfString &k ); | ||
| 17 | template<> bool __cmpHashKeys<UtfString>( | ||
| 18 | const UtfString &a, const UtfString &b ); | ||
| 19 | |||
| 20 | template<typename t> void __tracer_format( const t &v ); | ||
| 21 | template<> void __tracer_format<UtfString>( const UtfString &v ); | ||
| 22 | } | ||
| 23 | |||
| 24 | #endif | ||
