Hadoop MapReduce with Java for the given JSON dataset Parsing, Cleaning and put the output to HDFS
$10-30 USD
Terminado
Publicado hace casi 2 años
$10-30 USD
Pagado a la entrega
Parsing, Cleaning, and Profiling of the attached file [login to view URL] by removing hashtags, emoticons, or any redundant data which is not useful for analysis. And MapReduce output will be on HDFS like the image attached named "Output" but should be clean.
Tasks:
Dataset: [login to view URL]
Programming: MapReduce with Java
Data profiling: Write MapReduce java code to characterize (profile) the data in each column.
Data cleaning: Cleaning and Profiling the tweets by removing hashtags, emoticons, or any redundant data which is not useful for analysis. Write MapReduce java code to ETL (extract, transform, load) data source. Drop some unimportant columns, Normalize data in a column, and Detect badly formatted rows.
Hi, I've read your description carefully.
I have full experience with java
I've also worked on several similar projects.
So I can complete your project with high quality on time.
Looking forward to hear more about the project from you via chatting.
Thanks & Best regards!