I'm writing some unit tests which are going to verify our handling of various resources that use other character sets apart from the normal latin alphabet: Cyrilic, Hebrew etc.


The problem I have is that I cannot find a way to embed the expectations in the test source file: here's an example of what I'm trying to do...

/// Protected: TestGetHebrewConfigString
void CPrIniFileReaderTest::TestGetHebrewConfigString()
    prwstring strHebrewTestFilePath = GetTestFilePath( strHebrewTestFileName );
    CPrIniFileReader prIniListReader( strHebrewTestFilePath.c_str() );
    prIniListReader.SetCurrentSection( strHebrewSubSection );

    CPPUNIT_ASSERT( prIniListReader.GetConfigString( L"דונדארןמע" ) == L"דונהשךוק") );


This quite simply doesnt work. Previously I worked around this using a macro which calls a routine to transform a narrow string to a wide string (we use towstring all over the place in our applications so it's existing code)


wstring towstring( LPCSTR lpszValue )
    wostringstream os;
    os << lpszValue;
    return os.str();


The assertion in the test above then became:

CPPUNIT_ASSERT( prIniListReader.GetConfigString( UNICODE_CONSTANT( "דונדארןמע" ) ) == UNICODE_CONSTANT( "דונהשךוק" ) );

这在 OS X 上运行良好,但现在我正在移植到 linux,但我发现测试都失败了:这一切都让人觉得很hackish.谁能告诉我他们是否有更好的解决方案来解决这个问题?

A tedious but portable way is to build your strings using numeric escape codes. For example:

wchar_t *string = L"דונדארןמע";


wchar_t *string = "x05d3x05d5x05e0x05d3x05d0x05e8x05dfx05dex05e2";

您必须将所有 Unicode 字符转换为数字转义符.这样你的源代码就变得独立于编码了.

You have to convert all your Unicode characters to numeric escapes. That way your source code becomes encoding-independent.

您可以使用在线工具进行转换,例如这个.它输出 JavaScript 转义格式 uXXXX,所以只需搜索 &将 u 替换为 x 以获得 C 格式.

You can use online tools for conversion, such as this one. It outputs the JavaScript escape format uXXXX, so just search & replace u with x to get the C format.

08-03 20:05