使用HBase扫描在ScanMetrics中到底countOfRowsFiltered是什么?

本文介绍了使用HBase扫描在ScanMetrics中到底countOfRowsFiltered是什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个要进行大量插入和删除操作的表，我需要使用扫描"(仅按行键，没有列值)频繁地对其进行扫描.

I have a table that is subject to heavy insert and delete action, and I need to scan it frequently with Scans (only by row-key, no column values).

我注意到Scan延迟随着表中数据量的增加而增加.仔细检查ScanMetrics后，我注意到对于大多数更高延迟的扫描，ScanMetrics.countOfRowsFiltered的度量值比我实际请求扫描的行数高得多(我在我设置为scan的FilterList中的Scan和PageFilter().

I noticed that Scan latency increases as the amount of data in the table grows. After closer inspection of ScanMetrics, I noticed that for most higher-latency scans, the measure of ScanMetrics.countOfRowsFiltered is MUCH higher than the number of rows that I'm actually requesting to scan (which I specify both .setLimit() in the Scan and PageFilter() in the FilterList that I set to the scan).

countOfRowsFiltered的度量到底代表什么?在我的测试环境中，我永远无法重现所扫描的行数高于我设置的限制的情况，因此，该countOfRowsFiltered始终为零.但是在实际环境中，它通常很高(根据我的计算，这可能是整体扫描延迟逐渐增加的原因.)

What exactly does the measure of countOfRowsFiltered represent? In my testing environments, I can never reproduce the situation that the number of rows scanned is higher than what I set as a limit, and consequently this countOfRowsFiltered is always zero. But in the real environment it is frequently quite high (and according to my calculations, this may be the reason for the gradual increase in the overall scan latency).

我在那里找不到该措施的任何描述.有任何使用经验，以及如何将其最小化?

I can't find any description of this measure out there. Any experience with it, and how to minimize it?

我将扫描设置如下:

Scan scan = new Scan().withStartRow(rowKeyStart).withStopRow(rowKeyStop);
scan.setCaching(scanCache);
FilterList filterList = new FilterList(
        FilterList.Operator.MUST_PASS_ALL,
        new FirstKeyOnlyFilter(),
        new KeyOnlyFilter(),
        new PrefixFilter(myPrefix),
        new PageFilter(limit));

scan.setFilter(filterList);
scan.setCacheBlocks(false);
scan.setLimit(limit);
scan.setReadType(ReadType.PREAD);

scan.setScanMetricsEnabled(true);
ResultScanner scanner = myTable.getScanner(m_scan);

int processed = 0;
for (Result row : m_scanner.next(limit))
{
    // do something with this row
    if (++processed >= limit)
        break;
}

ScanMetrics sm = m_scanner.getScanMetrics();

long scanned = sm.countOfRowsScanned.get();
long filtered = sm.countOfRowsFiltered.get(); // WHAT IS THIS???

scanner.close();

Scans

使用HBase扫描在ScanMetrics中到底countOfRowsFiltered是什么?

问题描述

推荐答案