Hadoop sales require strong programming skills in Java, Python and Scala. The candidate should also have strong verbal and communication skills dealing with customers and partners. It requires a good understanding of analysis, design, coding and testing. Crafting enterprise data solutions for large organizations will be part of the skills required.



De 16405 comentarios, los clientes califican nuestro Hadoop Consultants 4.74 de un total de 5 estrellas.
Más información

Filtro

Mis búsquedas recientes
Filtrar por:
Presupuesto
a
a
a
Tipo
Habilidades
Idiomas
    Estado del trabajo
    7 trabajados encontrados, precios en USD

    Total Experience : 4+ years to 7 years Designation : Sr. Data Engineer Mandatory skills : Pyspark & EMR Location : Pune /Remote Job Description - 1) Hands-on experience with Python, Spark, EMR 2) Proficient understanding of distributed computing principles 3) Proficiency with Data Processing: HDFS, Hive, Spark, Scala/Python 4) Independent thinker, willing to engage, challenge and learn new technologies. 5) Understanding of the benefits of data warehousing, data architecture, data quality processes, data warehousing design, and implementation, 6) Table structure, fact and dimension tables, logical and physical database design, data modeling, reporting process metadata, and ETL processes. Requirements -- 1) Client-facing skills: Solid experience working with clients directly, to b...

    $1377 (Avg Bid)
    $1377 Oferta promedio
    3 ofertas

    Adventure works data movement from on Prem SQL server to S3 or snowflake using Apache nifi, talend or other etl tool Kafka etc

    $122 (Avg Bid)
    $122 Oferta promedio
    1 ofertas

    Spark Use Case (Movie Review Analysis) IMBD is an online database of movie-related information. IMBD users rate the movies and provide reviews. They rate the movies on a scale of 1 to 5; 1 being the worst and 5 being the best. The dataset also has additional Information, such as the release year of the movie. You have to analyze the data collected and answer the following questions. You need to find: 1) The total number of movies 2) The maximum rating of movies 3) The number of movies that have maximum rating 4) The movies with ratings 1 and 2 5) The list of years and number of movies released each year 6) The number of movies that have a runtime of two hours Steps to follow: 1. Create a table in RDBMS (MySql, MSsql, Oracle) and load the data in table (usign bulk inser...

    $172 (Avg Bid)
    $172 Oferta promedio
    9 ofertas

    Hi there! I’m looking for a software engineer with python and spark proficiency. Specifically, we need that two functions of an ipython jupyter notebook file (.ipynb) will be converted, or re-written, into spark code. The former script use mainly “pandas” and “numpy” libraries for data treatment, we need the same function using spark data treatment functions. Deliverables: • Jupyter notebook file with the new two functions using spark compatible code • List of libraries or additional packages if they would be needed I will give you: • Original notebook file • Requirements file (Spark 3.3.1) Feel free to ask for more information.

    $26 (Avg Bid)
    $26 Oferta promedio
    5 ofertas

    ※ Please, see the attached, and offer your price quote with questions [Price and time is negotiable] ※ Will need your help from end of Dec ~ Jan, 2023 1) Manual : Creating development and installation manual for overall service implementation guideline using HDFS – Impala API >All details must be provided : command/option/setting file/Config etc. > We will use your manual to create our own HDFS used solution >Additional two to four weeks of take-over time [We can ask some questions when the process does not work under the manual process] 2. Consulting : Providing solutions for the heavy load section(date inter delay) when data is insert through HDFS >Data should be processed in 3 minutes, but sometimes it takes more time > Solutions for how we can remove or de...

    $999 (Avg Bid)
    $999 Oferta promedio
    9 ofertas

    Cloudera Expert needed to add 6 nodes to basic CDH environment running cloudera 5.11. Platform runs very well but need more disk space and more nodes for compute resource distribution.

    $152 (Avg Bid)
    $152 Oferta promedio
    5 ofertas

    Hi, I need to stream Elastic search data to Spark Data frame as structured streaming in real time. Need to write this application in java.

    $420 (Avg Bid)
    $420 Oferta promedio
    4 ofertas

    Principales artículos de la comunidad Hadoop