Analysis and Experimental Study of HDFS Performance Cover Image

Analysis and Experimental Study of HDFS Performance
Analysis and Experimental Study of HDFS Performance

Author(s): Yordan Kalmukov, Milko MARINOV, Tsvetelina Mladenova, Irena VALOVA
Subject(s): ICT Information and Communications Technologies
Published by: UIKTEN - Association for Information Communication Technology Education and Science
Keywords: HDFS; Distributed file systems; Distributed and parallel computing; Hadoop cluster

Summary/Abstract: In the age of big data, the amount of data that people generate and use on a daily basis has far exceeded the storage and processing capabilities of a single computer system. That motivates the use of distributed big data storage and processing system such as Hadoop. It provides a reliable, horizontallyscalable, fault-tolerant and efficient service, based on the Hadoop Distributed File System (HDFS) and MapReduce. The purpose of this research is to experimentally determine whether (and to what extent) the network communication speed, the file replication factor, the files’ sizes and their number, and the location of the HDFS client influence the performance of the HDFS read/write operations.

  • Issue Year: 10/2021
  • Issue No: 2
  • Page Range: 806-814
  • Page Count: 9
  • Language: English
Toggle Accessibility Mode