本文介绍了打印C字符串(UTF-8)时NSLog()vs printf()的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!



I have noticed that if I try to print the byte array containing the representation of a string in UTF-8, using the format specifier "%s", printf() gets it right but NSLog() gets it garbled (i.e., each byte printed as-is, so for example "¥" gets printed as the 2 characters: "¬•").This is curious, because I always thought that NSLog() is just printf(), plus:

  1. 第一个参数(格式")是Objective-C字符串,而不是C字符串(因此为"@").
  2. 在时间戳记和应用程序名称之前.
  3. 换行符自动添加到末尾.
  4. 能够打印Objective-C对象(使用格式%@").
  1. The first parameter (the 'format') is an Objective-C string, not a Cstring (hence the "@").
  2. The timestamp and app name prepended.
  3. The newline automatically added at the end.
  4. The ability to print Objective-C objects (using the format "%@").


NSString* string;

// (...fill string with unicode string...)

const char* stringBytes = [string cStringUsingEncoding:NSUTF8Encoding];

NSUInteger stringByteLength = [string lengthOfBytesUsingEncoding:NSUTF8Encoding];
stringByteLength += 1; // add room for '\0' terminator

char* buffer = calloc(sizeof(char), stringByteLength);

memcpy(buffer, stringBytes, stringByteLength);

NSLog(@"Buffer after copy: %s", buffer);
// (renders ascii, no matter what)

printf("Buffer after copy: %s\n", buffer);
// (renders correctly, e.g. japanese text)

以某种方式,似乎printf()NSLog()更聪明".有谁知道根本原因,以及是否在任何地方都记录了此功能? (找不到)

Somehow, it looks as if printf() is "smarter" than NSLog(). Does anyone know the underlying cause, and if this feature is documented anywhere? (Couldn't find)


NSLog()stringWithFormat:似乎期望%s的字符串在系统编码"中(例如,在我的计算机上为"Mac Roman"):

NSLog() and stringWithFormat: seem to expect the string for %sin the "system encoding" (for example "Mac Roman" on my computer):

NSString *string = @"¥";
NSStringEncoding enc = CFStringConvertEncodingToNSStringEncoding(CFStringGetSystemEncoding());
const char* stringBytes = [string cStringUsingEncoding:enc];
NSString *log = [NSString stringWithFormat:@"%s", stringBytes];
NSLog(@"%@", log);

// Output: ¥


Of course this will fail if some characters are not representable in the system encoding. I could not find an official documentation for this behavior, but one can see that using %s in stringWithFormat: or NSLog() does not reliably work with arbitrary UTF-8 strings.


If you want to check the contents of a char buffer containing an UTF-8 string, thenthis would work with arbitrary characters (using the boxed expression syntax to create an NSString from a UTF-8 string):

NSLog(@"%@", @(utf8Buffer));

这篇关于打印C字符串(UTF-8)时NSLog()vs printf()的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

08-22 21:29