$40000 - $45000 Year
Monday to Friday
Day shift
Health insurance
Paid time off
Employee discount
Dental insurance
Vision insurance
401(k)
401(k) matching
Flexible schedule
Parental Leave
Tuition reimbursement
Flexible spending account
Retirement plan
Others
Key Responsibilities:
Infrastructure Design and Development:
Design and architect a scalable and secure on-premise hosting environment for large language models.
Develop and implement infrastructure automation tools for efficient management and deployment.
Ensure high availability and disaster recovery capabilities.
Performance Optimization:
Optimize the hosting environment for maximum performance and efficiency.
Implement monitoring tools to track system performance and resource utilization.
Regularly update the infrastructure to incorporate the latest technological advancements.
Security and Compliance:
Establish robust security protocols to protect sensitive data and model integrity.
Ensure compliance with data protection regulations and industry standards.
Conduct regular security audits and vulnerability assessments.
Collaboration and Support:
Work closely with AI/ML teams to understand their requirements and provide suitable infrastructure solutions.
Provide technical guidance and support to internal teams and stakeholders.
Stay abreast of emerging trends in AI infrastructure and large language model hosting.
Resource Management:
Manage physical and virtual resources to ensure optimal allocation and utilization.
Forecast resource needs and plan for future expansion and upgrades.
REQUIRED SKILLS
Qualifications:
Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
Proven experience with overall experience of 8+yrs working in infrastructure architecture, preferably with exposure to AI/ML environments.
Strong knowledge of networking, storage, and computing technologies.
Experience with virtualization technologies, cluster management tools and container orchestration tools (e.g., Kubernetes).
Familiarity with open-source large language models and their hosting requirements.
Excellent problem-solving and analytical skills.
Strong communication and collaboration abilities.
Preferred Skills:
Certifications in cloud architecture or systems engineering.
Experience with high-performance computing (HPC) environments.
Knowledge of data privacy laws and compliance requirements.
Communication
Leadership
Teamwork
Interpersonal
Learning/adaptability
Self-management
Organizational
Computer
Problem solving
Open mindedness
Strong work ethic
technology
Others
No experience needed
Others
Technology
On going position
Hybrid Remote