Mô Tả Công Việc
*According to Decree No.13/2023/ND-CP on protecting personal data (“PDP”), Home Credit Vietnam would apply "Personal Data Processing Agreement" with all candidates to ensure compliance with the decree.By submitting this application to Home Credit Vietnam Finance Company Limited through ITviec, you agree to allow Home Credit to proceed your provided information in accordance with Personal Data Processing Agreement that you have read, fully understood and agreed to the entire content at link KEY RESPONSIBILITIES ML Data Platform Engineering & AdministrationInstall, configure, and maintain ML data platforms on top of Kubernetes, Object Storage, Cassandra, Postgres and related technologiesMonitor platform performance and optimize as needed for reliability and efficiencyPlatform Configuration and MaintenanceImplement and manage platform configurations, ensuring adherence to best practices and security standardsRegularly update and patch systems to maintain security and stabilityCollaborate with Cross-Functional TeamsWork closely with ML and data engineers and other roles to align IT needs and strategiesProvide expert guidance on ML data platform best practices and optimizationTroubleshoot and Resolve Technical IssuesIdentify, diagnose, and resolve data platform problems in a timely mannerEscalate complex issues to upper-level support when necessaryBackup and Recovery ManagementImplement and maintain backup and recovery strategies for data platforms, ensuring data integrity and availabilityMaintain and Update DocumentationCreate and maintain documentation related to data platform administration, configuration, and maintenanceShare knowledge with team members and contribute to a culture of continuous learningEnhance Data Security and ComplianceEnsure data platforms adhere to security best practices and comply with relevant regulationsStay up-to-date on industry trends and evolving security standardsDrive Continuous ImprovementEvaluate and implement new technologies and techniques to enhance data platform performance and administrationProactively identify areas for improvement, prepare plan for implementation and get support from management and development teamsAlways prefer automation and code-first approach over hard to reproduce manual tasks
Xem toàn bộ Mô Tả Công Việc
Yêu Cầu Công Việc
A degree in computer science, software engineering, information technology or related fields is preferredProduction use experience with AI agents (like Langchain, Agno) and LLM stack (like KServe, vLLM, pgvector)At least 3 years of experience in data platform (Kafka, Spark, Airflow, Flink, KServe, MLFlow, Lakekeeper) and related infrastructure installation, administration, patching and automation (Linux, Ansible, Helm, Terraform, Kubernetes, Ceph)Continuous improvement mindset covering daily operations, stability, reliability and performance of ML data platformsKnowledge of key concepts like infrastructure-as-code, templates, playbooks, code versioning using GIT, CI/CD automation, high availability, disaster and recoveryProficient in Linux Shell, Python scripting, configuration using JSON, YAML filesAbility to troubleshoot infrastructure, analyze logs and setup/update monitoring dashboards and metrics, describe and document root cause, attend retrospective meetingsUnderstand IT systems documentation, data flow diagrams, integration diagrams and related terminology (UML)Understand major ML data platforms concepts – LLM, RAG, agent, quantization, fine-tuning, Data Lake, ACID, Distributed Query, Feature Store, Data Governance, Data Catalogue, Streaming, Batch , ParquetProficient in using documentation (Markdown, Visio, Office 365) and communication tools (MS Teams)Proficient user of change management and support ticket/service desk tools (JIRA SD)
Xem toàn bộ Yêu Cầu Công Việc
Quyền Lợi
13th Salary Fixed and KPI BonusPremium Health Care program24/7 Accidental Insurance100% Social InsuranceMeal + Phone AllowanceYearly Medical CheckupProfessional and Transparent Working EnvironmentApply Latest Financial Technology in the World