Analisa Crime Menggunakan Pig dan Hadoop

Authors

  • Indra Wicaksono Sriwijaya University

Abstract

In this research, it is processing and analyzing crime data using the Pig with Hadoop method. Apache Pig enables people to focus more on analyzing bulk data sets and to spend less time writing Map-Reduce programs. Similar to Pigs, who eat anything, the Apache Pig programming language is designed to work upon any kind of data. Pig Hadoop is basically a high-level programming language that is helpful for the analysis of huge datasets. Pig Hadoop was developed by Yahoo! and is generally used with Hadoop to perform a lot of data administration operations. For writing data analysis programs, Pig renders a high-level programming language called Pig Latin. Several operators are provided by Pig Latin using which personalized functions for writing, reading, and processing of data can be developed by programmers. In this study, three test scenarios were carried out : (i) the first stage of testing analyzes clusters, (ii) the second test of compiling data using Apache big data clusters, (iii) the third test processing datasets with Pig. The results obtained from the research the results of violence data that occurred in the capital of Texas, Austin in 2014-2015. This analysis includes several types of crimes such as theft. In analyzing we use an analytical method that uses a cluster that has been configured in the Cloudera operating system.

Published

2023-08-01