✨ Fill and validate PDF forms with InstaFill AI. Save an average of 34 minutes on each form, reducing mistakes by 90% and ensuring accuracy. Learn more

Data Scientist

Leidos Washington Navy Yard, Washington DC
data data scientist data cloud apache operating elasticsearch machine learning learning technical digital digital forensics forensics
October 1, 2022
Leidos
Washington Navy Yard, Washington DC
FULL_TIME

Description

Job Description:

Leidos is looking for a skilled data scientist with experience in data pipelining, cloud, and Apache Spark. This position is located in Mclean and requires an active TS/SCI with Polygraph.

The Sponsor's organization provides technical solutions and capabilities that enable a cadre of analysts to make critical assessments, including a platform to conduct enterprise search, digital forensics, and data analytics. The Sponsor's organization is building a modern cloud-based system to replace a legacy standalone system and requires an infrastructure team that can work in a quick-paced, dynamic, agile software development environment. The Sponsor adheres to Agile Scrum development methodology best practices and has 2 week sprint cycles. The Contractor team shall work with a variety of individuals, including key stakeholders and other development teams. However, the Sponsor project manager will manage priorities. They will document and maintain code and workflows such as version control systems/code repositories, task management tools, and open source-style contribution models and issue tracking and manage the operating system lifecycle (to include operating system upgrades, updates, patches, and configuration changes) and other duties requiring in-depth knowledge of server hardware and software technologies. Also responsible for setting user permissions (including roles) and troubleshooting permission issues.

Mandatory Skills:

  • Requires BS degree and 12 - 15 years of prior relevant experience or Masters with 10 - 13 years of prior relevant experience. May possess a Doctorate in technical domain.

  • Demonstrated experience working with an AWS cloud environment.

  • Demonstrated experience understanding and implementing system security requirements.

  • Demonstrated experience creating a testing environment to identify and improve bugs and efficiencies.

  • Demonstrated experience programming in Python, PowerShell, and Java 8+.

  • Demonstrated experience using Python and the PySpark library to read, write, and manipulate large structured and semi-structured datasets.

  • Demonstrated experience using task tracking and version control technology to include JIRA and GitHub.

  • Demonstrated experience using SQL and database technologies such as MySQL or SQL server.

  • Demonstrated experience using and managing ElasticSearch clusters.

  • Demonstrated experience tuning and optimizing ElasticSearch.

  • Demonstrated experience creating and managing ElasticSearch indices.

  • Demonstrated experience with Apache Spark and managing Spark clusters.

  • Demonstrated experience tuning and optimizing Apache Spark.

  • Demonstrated experience with the Databricks platform and Databricks API & CLI.

  • Demonstrated experience understanding Hadoop Distributed File System (HDFS).

  • Demonstrated experience understanding Databricks File System (DBFS).

  • Demonstrated experience with infrastructure as code (IaC) technologies including AWS Cloud Formation.

  • Demonstrated experience using automated build tools such as Jenkins.

  • Demonstrated experience implementing machine learning models on text and multimedia data such as Spark NLP and Computer Vision models.

  • Demonstrated experience with Linux operating systems and shell scripting such as Bash.

  • Demonstrated experience developing machine learning models on text data and structured datasets.

Desired:

  • Bachelor's degree in Engineering, Computer Science, Mathematics, Data Science, Statistics, or related field or equivalent work experience.

  • Demonstrated professional experience with cloud technology networks, digital forensics, or cybersecurity topics.

  • Demonstrated experience using artificial intelligence and machine learning technologies/packages.

  • Demonstrated experience using data pipelines and workflow technologies.

Pay Range:

Pay Range $113,100.00 - $174,000.00 - $234,900.00

The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.


Report this job

Similar jobs near me

Related articles