Analisa Big Data Menggunakan HDFS

Analisa Big Data Menggunakan HDFS

Authors

  • Muhammad Shafa Zauhair Adinata Sriwijaya University
  • Sari Nurhaliza
  • Muhammad Daffa Zamzola
  • Ririn Purnama Sari
  • Munawirul Akmal
  • Tria Lailani

Keywords:

Big Data

Abstract

Abstracts

 

                Big data is a data source that has a large volume, a lot of variety, and a very fast data flow. Examples of big data include data from social media and Google search queries. The data is capable of tracking disease activity and available data at any time. Big data processing is not an easy thing, so we need a tool that can help the processing of big data. One such tool is hadoop. Data processing using hadoop has not been maximized. Thus, faster data processing is required. One way to increase the speed of data processing is to apply spark for data processing in HDFS (Hadoop Distributed File System).

                In big data analysis, we will use HDFS tools to increase data processing speed, Hadoop tools which function as a frame work for storing data sets and Apache Hive as a database system for Apache Hadoop which functions for data summarization and data analysis. Then in this big data analysis using the Operating System called CentOS. CentOS is a Linux distribution operating system in carrying out a computing operation function. In operating CentOS, this analysis uses VM Ware as a simulation tool in running the operating system and data analysis.

 

Keywords: Big Data, Hadoop, HDFS, Apache Hive, CentOS

Downloads

Published

2022-10-30