JobsEQ by Chmura Logo

Data Engineer, 6+ Yrs’ of Exp, US Citizen

SoKat

Location: Baltimore, Maryland
Type: Non-Remote, Full-Time
Posted on: May 26, 2026
Data Engineer, 6+ Yrs’ of Exp, US Citizen
Company Description SoKat builds AI software that addresses the federal government's most complex data challenges, transforming large-scale datasets into operational intelligence. The team, rooted in Johns Hopkins University faculty and alumni, delivers custom machine learning models, generative AI applications, and enterprise analytics platforms that meet stringent federal security and compliance requirements. SoKat has won multiple competitive federal contracts by delivering production-ready AI solutions that are scalable, reliable, and trusted by end users. The company’s expertise spans generative AI, predictive modeling, computer vision, natural language processing, and autonomous decision systems, with a strong emphasis on explainability and human-centric design.
Role: The SoKat team is looking for a Data Engineer to help optimize Veteran VA customer experience through data, analytics, advanced analytics, and AI tools and techniques in direct support of the Veterans Experience Office.
Position Responsibilities:
• Develop data engineering pipelines to support data ingestion, data conditioning/cleansing, data integrations to enable data product development.
• Configurate data analytics processes to load data from various VA systems into Customer Experience Insights (CXI) data warehouse.
• Interface with data scientists, data analysts, data stewards to understand the functional and technical data management requirements and develop technical specifications to illustrate/capture the requirements.
• Support ad-hoc data analysis requests by quickly understanding requirements and mapping to data attributes. Deliver ad-hoc data reports based on client reporting specifications.
• Develop automated scripts for data extraction, conditioning, and transformation of large data sets into actionable insights for informed client decision-making – and maintain developed models.
• Augment data analytic product capabilities by studying information needs specific to the client; conferring with users; and following a hybrid agile software development lifecycle.
Required Qualifications:
• 6+ years of experience in Python/Spark programming for implementing data operations such as connectivity, data manipulation, data integration and publication.
• 6+ years of experience developing ETL pipelines and troubleshooting data load issues.
• 4+ years of experience in SQL or related query languages.
• 2+ years of experience in Databricks for data engineering, systems integration, or machine learning projects.
• 4+ years’ experience in data warehousing projects and understanding of implementation of dimensional modelling concepts such as fact and dimension tables.
• 2+ years of experience working with GitHub for code management and deployment.
• 2+ years of experience working with Agile development frameworks.
• Ability to quickly adapt and excel in a fast-paced environment.
• Experience working with Microsoft Word, PowerPoint, Excel, and Visio.
• Strong organizational and collaboration skills.
• Strong written and verbal communication skills to support business writing, reporting, team and client collaboration, and professional correspondence.
• Must hold a minimum of a Bachelor’s degree.
• Must be a US citizen.
• Must successfully complete background check.
IT Services and IT Consulting
Information Technology
Full-time