python - File not showing ._COPYING_ in HDFS -
i trying parse 1 file using python , trying store in hdfs. code working fine couple of days ago today morning not getting error while executing code in hdfs directory getting data.copying. not @ moving copying. doing wrong.
code:
import subprocess subprocess import popen, pipe cat = subprocess.popen(["hadoop", "fs", "-cat", "/user/cloudera/pin/*"], stdout=subprocess.pipe) dumpoff = popen(["hadoop", "fs", "-put", "-", "/user/cloudera/data"],stdin=pipe) obrind = "0" line1 = "" line in cat.stdout: code = line.split('|')[1] idval = line.split('|')[2] if (code == "ob"): obrind = runnno line1 =line.strip() + "|"+"ob_"+obrind dumpoff.stdin.write(line1) print(line1)
update:
configured capacity: 424169496576 (395.04 gb) present capacity: 348671389696 (324.73 gb) dfs remaining: 153408200704 (142.87 gb) dfs used: 195263188992 (181.85 gb) dfs used%: 56.00% under replicated blocks: 0 blocks corrupt replicas: 0 missing blocks: 0 missing blocks (with replication factor 1): 0 ------------------------------------------------- report: access denied user cloudera. superuser privilege required
Comments
Post a Comment