data.ai

Big Data Engineer, Data Modeling

data.ai

Remote job description

data.ai is the mobile standard and the trusted source for the digital economy. Our vision is to be the first Unified Data AI company that combines consumer and market data to provide insights powered by artificial intelligence. We passionately serve enterprise clients to create winning digital experiences for their customers.

We care deeply about our high-performance culture and operate as a global team. We have set excellence as our standard, hold each other accountable, continuously push innovation and win with style.

What can you tell your friends when they ask you what you do?

We're looking for an experienced Big Data Engineer who can create innovative new products in the analytics and data space. You will participate in the development that creates the world's #1 mobile app analytics service. Together with the team, you will build out new product features and applications using agile methodologies and open-source technologies. You will work directly with Data Scientists, Data Engineers, Product Managers, and Software Architects, and will be on the front lines of coding new and exciting analytics and data mining products. You should be passionate about what you do and excited to join an entrepreneurial start-up.

You will be responsible for and take pride in....

As a Big Data Engineer, we will need you to be in charge of model implementation and maintenance, and to build a clean, robust, and maintainable data processing program that can support these projects on huge amounts of data, this includes
  • Able to design and implement complex data product components based on requirements with possible technical solutions.
  • Write data programs using Python (e.g., pyspark) with a commitment to maintaining high-quality work while being confident in dealing with data mining challenges.
  • Discover any feasible new technologies lying in the Big Data ecosystem, for example, the Hadoop ecosystem, and share them with to team with your professional perspectives.
  • Get up to speed in the data science and machine learning domain, implementing analysis components in a distributed computing environment (e.g., MapReduce implementation) with instruction from Data Scientists.
  • Be comfortable conducting detailed discussions with Data Scientists regarding specific questions related to specific data models.
  • You should be a strong problem solver with proven experience in big data.
You should recognize yourself in the following...
  • Hands-on experience and deep knowledge of the Hadoop ecosystem.
  • Must: PySpark, MapReduce, HDFS.
  • Plus: Storm, Kafka.
  • Must have 2+ years of Linux environment development experience.
  • Proficient with programming in Python & Scala, experience in Pandas, Sklearn or Other data science and data analysis toolset is a big plus.
  • Experience in data pipeline design & automation.
  • Having a background in data mining, analytics & data science components implementation, and machine learning domain, familiarity with common algorithms and libs is a plus.
  • Passion for cloud computing (AWS in particular) and distributed systems.
  • You must be a great problem solver with the ability to dive deeply into complex problems and emerge with clear and pragmatic solutions.
  • Good communication, and cooperation globally.
  • Major in Math or Computer Science.
This position is located in Hyderabad, India.

data.ai are in the process of establishing an entity in India, in the interim the employees will be on the rolls of our Global Employer of Record, Innova Solutions


Show more Show less


Summary
Company name: data.ai
Remote job title: Big Data Engineer, Data Modeling
Job tags: Distributed Systems, Hadoop, analytics
  • location or timezone

  • category

    Data
  • posted

    401 days ago

Share or copy

Job alerts