Job Seekers, Please send resumes to resumes@hireitpeople.com
Technical Hiring Criteria (Must Haves):
- Azure, Kubernetes, CI/CD
- Programming Languages: C#.Net
- Platform/Environment: Microsoft Azure
- Database Management System: SQL Azure
- Application Packages, etc.:
Detailed Job Description:
- 5+ years of experience in the cloud SRE/Infrastructure, or any related fields·
- 5+ years configuring and managing cloud infrastructure (AWS, GCP, Azure)
- 2+ years working with cloud-agnostic configuration management frameworks (Ansible, Terraform, etc.)
- 2+ years of experience with queueing systems such as Kafka, RabbitMQ, SQS, etc
- 2+ years working with containerization technologies (Docker, Kubernetes, etc)
- 1+ years managing System Observability experience (Zabbix, CloudWatch, PagerDuty, Datadog, and Azure Monitor, SignalFx, Graphana, etc)
- Understanding of SSH, VPN, TCP/IP, DNS, HTTP(S), network routing and subnet
- Experience with an always-on and high-volume web server stack (Nginx, HAProxy, squid, etc)
- Experience with Azure PaaS, Azure networking, and Azure Site Reliability solutions
- Experience with AWS products including EC2, EBS, ELB, IAM, S3, Route 53, VPCs, Gateways, Lambda, etc.
- Experience with Azure DevOps services such as DevOps, Pipelines, Test Plans, Artifacts, etc
- Experience with CI/CD pipelines using tools such as Jenkins, Travis, Azure DevOps, TeamCity, etc
- Knowledge of Linux architecture, security, administration, performance monitoring/tuning, troubleshooting, and production operations
- Fluent in Python and Shell Scripting, with experience implementing automation and monitoring using shell scripting and other related tools
Job Responsibilities:
- Design and build infrastructure & systems that provide high levels of scalability, reliability, and performance, while balancing security, maintainability, reliability and operational excellence
- Interface across teams to codify and reliably test infrastructure changes using PromoteIQ s software development lifecycle
- Partner with Product and Dev teams to provide guidance and best practices around scalability, reliability, and performance of our productions systems, infrastructure, and software
- Work as a team on escalations, resolving critical issues that impact our highly available dev, test, and production systems
- Work with a creative engineering team to continuously implement and improve reliable and speedy build environments for DEV & QA; provide timely build status updates; automate as much as possible to improve efficiency and quality
- Promote innovation, implementation of cutting-edge technologies, outside-of-the-box thinking, teamwork, and self-organization
- Work with Github Actions or other build tools in a CI/CD process to build and deploy to our cloud-agnostic environment
- Ensure traceability, observability, and retrievability of system behavior
- Build logging, monitoring, and alerting systems to identify bottlenecks and assist with debugging, analysis, and optimization in a cloud-agnostic environment