Responsibilities include, but are not limited to:
Maintains and extends existing data centralization and automation software.
a) Possesses proven work experience or formal training as a Data Engineer, Machine Learning Engineer, or similar role.
b) Provides expertise in Python software development.
· Uses Python to read, create, and modify relational and graph databases.
· Uses Python to provide REST API services for databases.
· Uses GIT for software version control.
· Familiarity with Agile development methodologies.
· Familiarity full software development life cycle (SDLC)
· Complies with industry standards and best practices.
c) Maintains and extends existing automation software.
· Provides expertise in Apache Airflow.
· Uses Apache Airflow to design and implement data pipelines for extract, transform, and load operations.
· Uses Apache Airflow to design and implement pipelines for task automation.
· Uses Python and Apache Airflow to develop production machine learning pipelines from existing prototype models.
· Works with data scientists and algorithm developers to design and implement software applications pertaining to data and ML pipelines.
d) Provides expertise in Linux OS and Docker.
· Uses Linux OS as primary development environment.
· Uses Docker to containerize existing software applications.
e) Maintains and extends existing data infrastructure.
· Provides expertise in databases and database design, including both SQL, NoSQL, and ORMs.
· Provides expertise in developing data models for various data modalities.
f) Maintains and extends software documentation according to 'best practices'.
· Create technical documentation, flow charts, and specifications for reference, reporting, and security.
Collaborates with MSIC personnel and shares leading practices and lessons learned in formal training events.
Collaborates with other contractor teams supporting each division across MSIC who will operate as domain-specific mission owners. These contractor teams will provide the necessary domain-expertise as it pertains to MSIC's S&TI mission. Additionally, these contractor teams develop and maintain domain specific data processing and analytic tools that will be integrated into automated workflows and applications developed under this effort.
Desired experience:
· Python development
· Agile methodologies
· Design and implement data pipelines
· Tools: Apache Airflow, Linux OS, Docker, SQL, noSQL, ORMs, graph databases, GIT
Security Clearance Requirements:
· Must have an active Top Secret Security Clearance and the ability to obtain TS/SCI with CI Polygraph.
Education Requirements:
· Bachelor’s degree
Other Requirements:
· Must be a US citizen and be able to obtain and hold an active Top Secret Security Clearance with SCI and CI Polygraph.
· 2+ years’ experience in data engineering, data science, software engineering, machine learning engineering