Client is seeking a senior AWS technologist with extensive experience in AWS Cloud. This role will focus on researching and identifying AWS service resiliency characteristics and failure modes, designing experimentation to validate its resiliency behavior, surfacing any potential weakness/issues, working with AWS engineers in clarifying and engineering to resolve those weakness/issues, and working with clients engineers to solution for the required level of resiliency. The role will require that the individual be familiar with current and emerging cloud technologies, tools and system architectures. The role requires an individual who can work closely with application developers, architects and infrastructure engineers to deliver resilient solutions in a team-oriented environment. The expectation is that this candidate will work with other principal technologists in the organization to develop a resiliency model for supporting our applications’ business and technical requirements as we establish a significant presence within the Cloud landscape.
This role is considered part of the Business Unit Senior Leadership team and may mentor delivery team members.
Responsibilities:
- Lead and implement analytical data lakes and accepted best practices in AWS.
- Intimate knowledge of all phases of AWS infrastructure and concepts such as VPC, subnets, security groups, S3 object stores, RDS, EC2, Glacier, Lambda, IAM, enterprise security, data security, encryption, DevOps, replication and disaster recovery.
- Deep experience with security in cloud environments around HIPAA, PHI/PII data, data encryption at rest and in transit as well as security concepts like tokens, federated security models and secrets management.
- Support development of a highly scalable and extensible Big Data platform on cloud which enables collection, storage, modeling, and analysis of massive data sets from numerous channels with an emphasis on proper security and data governance best practices.
- Familiarity with DevOps and CI/CD as well as Agile tools and processes including Git, Jenkins, Jira and Confluence.
- Provide support to Data Engineering teams on deployment of Hadoop/Spark jobs, work with IT Operations and Information Security teams on monitoring and troubleshooting of incidents to maintain service levels and coordinate with Vendor teams on installation, bug fixes, upgrades and escalations.
- Benchmark systems and analyze system bottlenecks in a hybrid environment and propose solutions to eliminate them.
- Research and provide recommendations on automating administration tasks and contribute to the evolving systems architecture to meet changing requirements for scaling, reliability, performance and manageability.
- Provide mentoring, knowledge transfer and assist in training for other team members.
- Understand business goals and drivers and translate those into an appropriate technical solution.
- Identify explicit/implicit technical assumptions and devise corresponding experimentation to validate the key resiliency assumptions.
- Participate in the definition of high availability and resilience standards and best practices for services from AWS and other internal/external providers.
- Participate in the assessment of service readiness for tiered business applications and service adoption.
- Participate in the establishment of appropriate monitoring and alerting of service events related to performance, scalability, availability, and reliability.
- Contribute to cloud strategy discussions and decisions on overall Cloud design and best approach for implementing cloud solutions.
- Focus on continuous improvement practices as required to meet system resiliency imperatives.
- Act as a liaison with other architects (security, infrastructure, data, etc.) and with delivery teams working primarily within an Agile (Scrum) methodology.
- Establish relationships with IT leaders, architects, and technical specialists in advancing proposed resiliency solutions.
- Support for the adoption of DevOps methodology and Agile project management.
- Communicate complicated technical concepts effectively to a broad group of stakeholders.
- Engage with Technical Architects and technical staff to determine the most appropriate technical strategy and designs to meet business needs.
- Demonstrate broad solutions technical leadership, impacting significant technical direction, exerting influence outside of the immediate team and driving change.
Qualifications:
- Extensive experience in administering Hadoop and Spark cluster environments with Cloudera’s distribution on-premise is required as well as experience in administering a hybrid environment.
- Intimate knowledge of all phases of AWS infrastructure and concepts such as VPC, subnets, security groups, S3, RDS, EC2, Glacier, Lambda, IAM, security, encryption, DevOps, replication and disaster recovery.
- Knowledge of analytical workspaces such as Jupyter, Zeppelin, RStudio as well as data science analytical methodologies.
- Familiarity with DevOps and CI/CD as well as Agile tools and processes including Git, Jenkins, Jira and Confluence.
- Ability to elicit requirements and communicate clearly with non-technical individuals, development teams, and other ancillary project members.
- Proficient in authoring, editing and presenting technical documents.
- Strong written and oral communication skills; Ability to communicate effectively with technical and non-technical staff.
- Desire to mentor younger team members and develop their skills.
- Experience working on multiple concurrent projects.
- Excellent problem-solving skills.
- Be independent and self-driven.
- Bachelor’s degree in Computer Science or related field.
- Must be open to travel.
Preferred skills and education:
- Master’s degree in Computer Science or related field.
- Certification in AWS preferred.
- Ability to work independently, with minimal supervision.
- Strong knowledge of AWS cloud environment.
- Knowledge of AWS cloud monitoring tools (CloudWatch, CloudTrail, Splunk, and other application monitoring).
- Knowledge of AWS Identity Access Management (IAM).
- In-depth, hands-on expertise in Java, MySQL, Linux.
- Ability to estimate the financial impact of various solution architecture alternatives.
- Must be comfortable working in an open, highly collaborative team.
- Strong troubleshooting skills.
- Ability to write scripts (Bash, PHP, Python) for automation of solution resiliency validation and verification.
- Excellent oral and written communication skills along with the ability to communicate at all levels.
- Experience with provisioning and configuration tools like AWS CloudFormation/Terraform a plus.
#J-18808-Ljbffr