Job Description
Systems Engineer
McLean, VA
The Sponsor supports analysts through the provision of large datasets, methodologies, and data visualizations to address pressing intelligence questions. The Sponsor requires support developing and maintaining a cloud-based data environment to transport, store, extract, transform, and load (ETL), and disseminate data solutions. The Sponsor needs experienced support in data engineering, cloud architecture, and application development. The work includes engaging regularly with data scientists, analysts, and managers. The Data Engineer will assist with strategic planning and oversee implementation of the Sponsor's cloud-based data environment, to include mapping of data sources and access controls. They will develop code, data models, and documentation to Sponsor standards; provide systems administration and programming support for ETL processes and data infrastructure efforts; and train and conduct knowledge transfer to team members on issues and technologies related to Sponsor ETL process, on premise high capacity compute cluster, and administrative duties. The Data Engineer will coordinate with external data and platform providers to ensure the smooth functioning of the Sponsor's systems and data flows, and to accomplish any needed changes and coordinate with experts to assist with technical aspects required to acquire new datasets or data management technologies for inclusion in the Sponsor's environment. They will also support the cross-domain transfer and integration of data.
Requirements
Technologies/Tools
Mandatory Skills:
- Demonstrated experience serving as a technical liaison between system engineers, data engineers, data scientists, analysts, and non-technical managers and personnel.
- Demonstrated experience with AWS cloud services, including long-term storage options, and cloud-based database services such as Databricks or Elastic MapReduce (EMR).
- Demonstrated experience with SQL database structures and mapping between SQL databases.
- Demonstrated experience in large-scale data migration efforts.
- Demonstrated experience with database architecture, performance design methodologies, and system-tuning recommendations. Preference for familiarity with Glue, Hive, and Iceberg or similar
- Demonstrated experience with Python, Bash, and Terraform
- Demonstrated experience with DevSecOps solutions and tools
- Demonstrated experience implementing CI/CD pipelines using industry standard process
Desired Skills:
- Demonstrated experience with the Sponsor's data environment and on-premises compute structure.
- Demonstrated experience with Data Quality and Data Governance concepts and experience.
- Demonstrated experience maintaining, supporting, and improving the ETL process through the implementation and standardization of data flows with Apache Nifi and other ETL tools.
- Demonstrated experience with Apache Spark
Benefits
Vacation 5 weeks of accrued paid vacation per year (i.e., 8.33 hours accrued per pay period worked)
Holidays - Paid holidays published annually by the Office of Personnel Management, excluding Inauguration Day
100% paid for Health Benefits (United Healthcare, Guardian Dental, VSP Vision, MetLife, Life and Disability Insurance and annual $1500 employer HSA contribution on qualified plans) health benefits kick in the 1st of the month following your start date
6% 401k Contribution (3% paid out during each pay period, the additional 3% will be paid out as a lump sum in Q1 each year)
Training Reimbursement Approved training and education expenses will be reimbursed
Travel Expenses Approved travel expenses will be reimbursed Note From time to time, the company may change employee benefits.
Qualification
Bachelor's Degree
Key skill Required
- SQL
- Architecture
- Python
- AWS
- Apache
- Apache NiFi
- Data Engineering
- Data Governance
- CI/CD
- Apache Spark
- Application Development
- Bash
- Data Infrastructure
- Data Management
- Data Migration
- Data Quality
- Database
- Database Architecture
- Design
- Development
- DevSecOps
- Disability Insurance
- Documentation
- Governance
- Healthcare
- Implementation
- Infrastructure
- Insurance
- Integration
- Intelligence
- Knowledge Transfer
- Management
- MapReduce
- Provision
- Reimbursement
- SPARK
- Standardization
- Strategic Planning
- Technical Aspects
- Terraform
- Training