Hartelijk dank voor alle reacties, deze aanvraag is ingevuld en dus bij deze gesloten.
Met vriendelijke groet,
Thijs van Gulik
For one of our clients in Brussels we are currently seeking a Data Engineer
Identify the most appropriate data sources to use for a given purpose and understand their structures and contents
Extract structured and unstructured data from the source systems (relational databases, data warehouses, document repositories, file systems, ), prepare such data (cleanse, re-structure, aggregate, ) and load them onto Hadoop.
Actively support data scientists in the data exploration and data preparation phases. Where data quality issues are detected, liaise with the data supplier to do root cause analysis
Where a use case is meant to become a production application, contribute to the design, build and launch activities
Ensure the maintenance and support of production applications (watch duty)
Liaise with teams to address infrastructure issues and to ensure that the components and software used of the platform are all consistent
Where the skills allow for it, perform advanced data analysis on a selection of business use cases, supported by data scientists
MANDATORY: min 90 % work time, Hadoop knowledge required, Experience in ETL.
Experience with understanding and creating data flows, with data architecture, with ETL/ELT development (MS SQL Server SSIS, Datastage, ) and with processing structured and unstructured data
Ability to write performant SQL statements
Understanding of the Hadoop ecosystem including Hadoop file formats like Parquet and ORC
Experience with open source technologies used in Big Data analytics like Spark, Pig, Hive, HBase, Kafka,
Ability to write MapReduce & Spark jobs
Knowledge of Cloudera
Ability to analyze data, to identify issues like gaps and inconsistencies and to do root cause analysis
Knowledge of Java
Experience with Linux Redhat and Linux Scripting
Experience delivering scripts
Experience in working with customers to identify and clarify requirements
Ability to design solutions that are fit for purpose whilst keeping options open for future needs
Strong verbal and written communication skills, good customer relationship skills
Knowledge of R, Python and Scala
Knowledge of IBM Mainframe
Knowledge of or experience in classic and new/emerging Business Intelligence methodologies
Reactie is prive en alleen zichtbaar voor de opdrachtgever en de plaatser van de reactie.
Je moet inloggen voordat je een reactie kunt plaatsen.