The words contained in this file might help you see if this file matches what you are looking for:
...Credo systemz hadoop interview questions top and answers what is apache an open source software framework for distributed storage processing of large data sets means it freely available even we can change its code as per our requirements makes possible to run applications on the system with thousands commodity hardware nodes s file has provision rapid transfer rates among also allows continue operating in case node failure main components layer hdfs batch engine mapreduce resource management yarn unit responsible storing different kinds blocks a environment follows master slave topology are namenode datanode parallel across cluster used analysis uses twostep map reduce process yet another negotiator which manages resources provides execution processes manager why do need since very so such huge amount difficult security size keeping secure challenge analytics big most time unaware kind dealing analyzing that more quality messy inconsistent incomplete discovery using powerful algorithm ...