c# - Process.StandardOutput Read方法返回空值(有时)

我正在使用wkhtmltopdf从HTML字符串生成PDF文件。该代码几乎是以下代码：

// ...
processStartInfo.UseShellExecute = false;
processStartInfo.CreateNoWindow = true;
processStartInfo.RedirectStandardInput = true;
processStartInfo.RedirectStandardOutput = true;
processStartInfo.RedirectStandardError = true;

// ...
process = Process.Start(processStartInfo);
using (StreamWriter stramWriter = process.StandardInput)
{
    stramWriter.AutoFlush = true;
    stramWriter.Write(htmlCode);
}

byte[] buffer = new byte[32768], file;
using (var memoryStream = new MemoryStream())
{
    while (true)
    {
        int read = process.StandardOutput.BaseStream.Read(buffer, 0, buffer.Length);
        if (read <= 0)
            break;
        memoryStream.Write(buffer, 0, read);
    }
    file = memoryStream.ToArray();
}

process.WaitForExit(60000);
process.Close();

return file;

这可以按预期工作，但是对于一段特定的HTML，StandardOutput.BaseStream.Read方法的第一次调用返回一个空字节数组，在这种情况下StandardOutput.EndOfStream也为true。

我通常会怀疑wkhtmltopdf工具由于某种原因而无法处理HTML输入，但是问题在于，这种情况只会在五次尝试中发生两次，因此我现在怀疑这可能与进程缓冲和输出流有关阅读。但是，我似乎无法
找出确切的问题是什么。

是什么导致这种现象？

更新资料

读取StandardError是显而易见的方法，但没有帮助，它始终是一个空字符串。根据我的了解，process.ExitCode（-1073741819）也没有，它只是指出“进程崩溃”。

最佳答案

经过将近一年的生产使用，wkhtmltopdf开始工作，到目前为止，上述问题报道的次数不超过五次。

当在文档末尾的某个位置添加DIV时，如果页面恰好已满，其高度值足以使文本的最后一行移至下一页（例如20px），则问题通常会消失。

我们知道该工具有时无法正确地将HTML内容拆分为多个页面，因为在这种情况下，它生成了（例如）七个页面，而页面编号仅报告了六个。因此最后一页的编号为“ 6之7”。我们认为，也许有时它会完全失败并且根本无法生成页面。该文档是从高度动态的HTML内容生成的。在不使用虚拟DIV的情况下进行更改导致内容变短/变长的更改相对容易，这就是我们迄今为止解决错误的方法。

现在，我们正在测试puppeteer。

关于c# - Process.StandardOutput Read方法返回空值(有时)，我们在Stack Overflow上找到一个类似的问题：https://stackoverflow.com/questions/46707977/