This document contains information about Apache HBase including links to documentation pages, JIRA issues, and discussions on using HBase. It provides configuration examples for viewing HFile contents, explains how Bloom filters are used in HBase, includes an overview of the HBase data model and comparisons with RDBMS. It also shows an example Git diff of modifying the HBase heap size configuration and provides links to guides on using HBase and documentation on region splitting and merging.
28. To view a textualized version of hfile content, you can do use
theorg.apache.hadoop.hbase.io.hfile.HFile tool.
Type the following to see usage:
$ ${HBASE_HOME}/bin/hbase org.apache.hadoop.hbase.io.hfile.HFile
For example, to view the content of the
filehdfs://10.81.47.41:8020/hbase/TEST/1418428042/DSMP/475950861828
6845475, type the following:
$ ${HBASE_HOME}/bin/hbase org.apache.hadoop.hbase.io.hfile.HFile -v -f
hdfs://10.81.47.41:8020/hbase/TEST/1418428042/DSMP/4759508618286845475
If you leave off the option -v to see just a summary on the hfile.
29. Burton Howard Bloom [1]
probabilistic data structure
element set
http://www.quora.com/How-are-bloom-filters-used-in-HBase
50. $ git diff hbase-env.sh
diff --git a/conf/hbase-env.sh b/conf/hbase-env.sh
index e70ebc6..96f8c27 100644
--- a/conf/hbase-env.sh
+++ b/conf/hbase-env.sh
@@ -31,7 +31,7 @@ export JAVA_HOME=/usr/lib//jvm/java-6-sun/
# export HBASE_CLASSPATH=
# The maximum amount of heap to use, in MB. Default is 1000.
-# export HBASE_HEAPSIZE=1000
+export HBASE_HEAPSIZE=8192
# Extra Java runtime options.
# Below are what we set by default. May only work with SUN JVM.