Site Reliability Engineer (SRE) - Mandarin Speaking
Location: Chicago-HQ/Hybrid
Chowbus is a SaaS (Software as a Service) company that began as an online platform for food ordering, payment, and delivery. The company has since shifted its focus to providing an all-in-one POS (point-of-sale) system tailored to the evolving needs of the restaurant industry. Headquartered in Chicago, Illinois, Chowbus serves over 2,000 restaurants with partners across 20 major U.S. cities. Our mission is to build the most comprehensive ecosystem to empower restaurants.
We are seeking a highly motivated Site Reliability Engineer (SRE) with 2-3 years of hands-on experience in managing and scaling infrastructure. The ideal candidate will have a strong background in Computer Science or Electrical Engineering and be fluent in both English and Mandarin. This role provides an exciting opportunity to work on cutting-edge infrastructure projects, cloud-based solutions, and automation while contributing to the long-term strategy of IT management.
Responsibilities
- Infrastructure Management: Maintain and optimize cloud-based infrastructure to ensure high availability, reliability, and scalability of services.
- Site Reliability Engineering (SRE): Implement SRE best practices, focusing on monitoring, automation, and system performance to improve the reliability and efficiency of the infrastructure. Proactively monitor system health and respond to incidents, troubleshoot issues, and implement solutions to prevent future occurrences.
- Automation: Automate operational tasks such as provisioning, configuration, and monitoring using Infrastructure as Code (IaC) tools like Terraform and Ansible.
- Collaboration: Work closely with development teams to ensure seamless integration of new software and infrastructure solutions.
- IT Management: Develop highly automated IT management and AI integration for automating routine tasks.
Qualifications
- Education: Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field.
- Experience: 2-3 years of experience in Infrastructure, SRE, or DevOps roles, with hands-on experience in cloud environments (AWS, GCP, or Azure), containers (Docker, Kubernetes), and automation tools.
- Languages: Fluent in both English and Mandarin (written and spoken) for effective communication with global teams.
Technical Skills:
- Proficient in Linux/Unix systems.
- Experience with cloud platforms (AWS, Google Cloud, Azure).
- Familiarity with Infrastructure as Code (Terraform, Ansible, etc.).
- Experience with CI/CD tools (Jenkins, Github Actions, CircleCI, etc.).
- Strong knowledge of scripting languages (Python, Bash, etc.).
- Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack).
- Understanding of networking, security, and system performance.
Preferred Qualifications:
- Experience with AWS ECS and container orchestration tools.
- Exposure to IT management principles or a strong interest in progressing into IT leadership roles.
- Knowledge of cybersecurity best practices and compliance regulations.
- Familiarity with Agile and DevOps methodologies.
Soft Skills:
- Strong problem-solving skills and ability to troubleshoot complex issues.
- Excellent communication and teamwork skills to collaborate effectively with cross-functional teams.
- Self-motivated with a passion for learning new technologies and optimizing systems.
- High level of responsibility and accountability.
What We Offer:
- Medical, dental, and vision insurance.
- 401(k).
- 100% employer-paid Short-Term Disability (STD).
- 100% employer-paid Life Insurance and option for additional employee-paid Life Insurance.
- 100% employer-paid Accidental Death and Dismemberment (AD&D) Insurance and option for additional employee-paid AD&D Insurance.
#J-18808-Ljbffr