marshal 转储更快，cPickle 加载更快

本文介绍了marshal 转储更快，cPickle 加载更快的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在实现一个需要序列化和反序列化大对象的程序，所以我用 pickle、cPickle 和 marshal 做了一些测试> 模块选择最佳模块.一路上我发现了一些非常有趣的事情:

I'm implementing a program that needs to serialize and deserialize large objects, so I was making some tests with pickle, cPickle and marshal modules to choose the best module. Along the way I found something very interesting:

我正在使用 dumps 然后 loads(对于每个模块)在字典、元组、整数、浮点数和字符串列表上.

I'm using dumps and then loads (for each module) on a list of dicts, tuples, ints, float and strings.

这是我的基准测试的输出:

This is the output of my benchmark:

DUMPING a list of length 7340032
----------------------------------------------------------------------
pickle => 14.675 seconds
length of pickle serialized string: 31457430

cPickle => 2.619 seconds
length of cPickle serialized string: 31457457

marshal => 0.991 seconds
length of marshal serialized string: 117440540

LOADING a list of length: 7340032
----------------------------------------------------------------------
pickle => 13.768 seconds
(same length?) 7340032 == 7340032

cPickle => 2.038 seconds
(same length?) 7340032 == 7340032

marshal => 6.378 seconds
(same length?) 7340032 == 7340032

因此，从这些结果中我们可以看到 marshal 在基准测试的转储部分非常快:

So, from these results we can see that marshal was extremely fast in the dumping part of the benchmark:

比 pickle 快 14.8 倍，比 cPickle 快 2.6 倍.

但是，令我大吃一惊的是，marshal 在加载部分比 cPickle 慢得多:

But, for my big surprise, marshal was by far slower than cPickle in the loading part:

比 pickle 快 2.2 倍，但比 cPickle 慢 3.1 倍.

至于 RAM，marshal 性能同时加载也非常低效:

And as for RAM, marshal performance while loading was also very inefficient:

我猜测使用 marshal 加载如此缓慢的原因与它的序列化字符串的长度有关(比 pickle 和 长得多)cPickle).

I'm guessing the reason why loading with marshal is so slow is somehow related with the length of the its serialized string (much longer than pickle and cPickle).

为什么 marshal 转储速度更快，加载速度更慢?
为什么marshal 序列化的字符串这么长?
为什么 marshal 的加载在 RAM 中如此低效?
有没有办法提高marshal的加载性能?
有没有办法将 marshal 快速转储与 cPickle 快速加载合并?

Why marshal dumps faster and loads slower?
Why marshal serialized string is so long?
Why marshal's loading is so inefficient in RAM?
Is there a way to improve marshal's loading performance?
Is there a way to merge marshal fast dumping with cPickle fast loading?

Marshal

marshal 转储更快，cPickle 加载更快

问题描述

推荐答案