hadoop - Total number of replicated files after copying hdfs file into hive table -
suppose if load file in hdfs hive table total replicas of file. in hdfs file replicated 3 times , copying hive table results in additional replicas sums 6 replicas or not??
in hdfs, number of replicas based on replication factor set. in case, since replication factor 3, there 3 copies.
when sqoop import hdfs hive(into internal table), data copied 1 location on hdfs table in hive. replication of hive data again happens based on replication factor.
in total end 3(hdfs) + 1(hive copy)*3 => 3copies on hdfs , 3 copies of data stored hive(this not 6 copies, hive doesn't store data in same file format).
or
if load data inpath
internal table old copy lost , newer hive copy exists. end hive table(and replicated copies).
in case, 3 hive table copies(as rep set 3).
or
if create external table, no new copy created. meta of data created hive. end hdfs copies + hive meta storage copies.
in case, 3 copies in hdfs + 3 copies of meta data stored on hive.
Comments
Post a Comment