Ability to build an effective data architecture, streamline data processing, and maintain large-scale data systems. Able to work with Python, Shell and SQL. to create data engineering pipelines, automate common file system tasks, and build a high-performance database.
Experienced with cloud and big data tools such as AWS Boto, PySpark and MongoDB, to help create and query databases, wrangle data, and configure schedules to run pipelines.