Data Engineer

October 2, 2023

Apply for this job

Email *

Job Description

<p>For our client that is a Fin-Tech company based in Israel</p><p><br></p><p><u>Requirements:</u></p><p>●3+ years of hands-on experience with SQL.</p><p>●&nbsp;Experience with Spark scripting languages: PySpark/Scala/Java/R</p><p>●&nbsp;Hands-on experience with data transformation, validations, cleansing, and ML feature engineering</p><p>●&nbsp;Hands-on experience working with Apache Spark cluster – an advantage.</p><p>●&nbsp;BSc degree or higher in Computer Science, Statistics, Informatics, Information Systems, Engineering, or another quantitative field.</p><p>●&nbsp;Experience working with and optimizing big data pipelines, architectures, and data sets – an advantage.</p><p>●&nbsp;Strong analytic skills related to working with structured and semi-structured datasets.</p><p>●&nbsp;Build processes supporting data transformation, data structures, metadata, dependency, and workload management.</p><p>●&nbsp;&nbsp;Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.</p><p>●&nbsp;Business-oriented and able to work with external customers and cross-functional teams.</p><p>●&nbsp;Fluent in English, both written and spoken</p><p>&nbsp;&nbsp;</p><p><u>Nice to have:</u></p><p>●&nbsp;Experience with Linux</p><p>●&nbsp;Experience in building Machine Learning pipeline</p><p>●&nbsp;Experience with Elasticsearch</p><p>●&nbsp;Experience with Zeppelin/Jupyter</p><p>●&nbsp;Experience with workflow automation platforms such as Jenkins or Apache Airflow</p><p>●&nbsp;Experience with Microservices architecture components, including Docker and Kubernetes.</p><p><br></p><p><u>Key Responsibilities:</u></p><p>● Implement and maintain data pipeline flows in production within the ThetaRay system based on the data scientist’s design</p><p>● Design and implement solution-based data flows for specific use cases, enabling the applicability of implementations within the ThetaRay product</p><p>● Building a Machine Learning data pipeline</p><p>● Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader</p><p>● Work with product, R&amp;D, data, and analytics experts to strive for greater functionality in our systems</p><p>● Train customer data scientists and engineers to maintain and amend data pipelines within the product</p><p>● Travel to customer locations both domestically and abroad</p><p>● Build and manage technical relationships with customers and partners</p><p>•&nbsp;&nbsp;&nbsp;Travel to customer locations abroad</p>