hadoop - Total number of replicated files after copying hdfs file into hive table -


suppose if load file in hdfs hive table total replicas of file. in hdfs file replicated 3 times , copying hive table results in additional replicas sums 6 replicas or not??

in hdfs, number of replicas based on replication factor set. in case, since replication factor 3, there 3 copies.

when sqoop import hdfs hive(into internal table), data copied 1 location on hdfs table in hive. replication of hive data again happens based on replication factor.

in total end 3(hdfs) + 1(hive copy)*3 => 3copies on hdfs , 3 copies of data stored hive(this not 6 copies, hive doesn't store data in same file format).

or

if load data inpath internal table old copy lost , newer hive copy exists. end hive table(and replicated copies).

in case, 3 hive table copies(as rep set 3).

or

if create external table, no new copy created. meta of data created hive. end hdfs copies + hive meta storage copies.

in case, 3 copies in hdfs + 3 copies of meta data stored on hive.


Comments

Popular posts from this blog

mysql - Dreamhost PyCharm Django Python 3 Launching a Site -

java - Sending SMS with SMSLib and Web Services -

java - How to resolve The method toString() in the type Object is not applicable for the arguments (InputStream) -