Who We Are
Artmac Soft is a technology consulting and service-oriented IT company dedicated to providing innovative technology solutions and services to customers.
Job Description
Job Title : Senior Cloud Infrastructure Engineer
Job Type : W2
Experience: 5-10 Years
Location : Santa Clara, New Mexico
We are looking for a Senior Cloud Infrastructure Engineer who will be responsible for the successful implementation and management of cloud-based infrastructure. The ideal candidate will stay up-to-date with the latest trends and advancements in Python, APIs, web server administration, application server administration, and Machine Learning technologies.
Responsibilities
- Proven experience with Cloud Kubernetes/OpenShift.
- Extensive experience with AWS services (EC2, S3, RDS, etc.).
- Experience with Machine Learning infrastructure management and Databricks platform administration.
- Experience with Python, APIs, and web server/application server administration.
- Solid understanding of storage management, both on-premises and cloud storage technologies.
- Strong expertise in CICD implementation, architecture design, containerization, and Docker.
- Ability to monitor and troubleshoot system and application issues effectively.
- Knowledge of security best practices for protecting APIs, servers, and cloud infrastructure.
- Install and configure the latest versions of Python Manage Python upgrades and maintain Python environments.
- Understand on-premises (NAS) and cloud storage technologies (FSX, Snap Mirror, S3).
- Administer and maintain Python-based APIs, web servers, and application servers.
- Ensure optimal performance and availability of APIs and servers.
- Monitor and troubleshoot system issues including performance bottlenecks, server crashes, and connectivity problems.
- Manage and maintain Machine Learning infrastructure, including model deployment and monitoring, data pipelines, and other ML components.
- Administer and maintain the Databricks platform including cluster management, user access, and security configuration.
- Monitor and troubleshoot Databricks clusters, job scheduling, data pipelines, and data processing workflows.
- Deploy and manage applications on AWS using services like EC2, S3, and RDS.
- Ensure scalability, security, and optimal performance of the Kubernetes infrastructure.
- Implement Continuous Integration and Continuous Deployment (CI/CD) pipelines for application releases.
- Implement and configure monitoring tools and frameworks such as Prometheus, Grafana, or similar.
- Set up monitoring dashboards to track the health, performance, and availability of applications and infrastructure.
Qualification
- Bachelor's degree or equivalent combination of education and experience