Mô Tả Công Việc
About the RoleWe are looking for a Data Engineer to build and operate the backbone of our robotics data infrastructure. In this role, you will design and maintain scalable data pipelines that collect, process, and store large volumes of multimodal data generated from robots at the edge.You will work closely with cross-functional teams including Vision, Conversation AI, and Robotics Engineering to ensure high-quality data flows into centralized systems for training, analysis, and intelligent querying. Key ResponsibilitiesBuild and Maintain Data PipelinesDesign and implement end-to-end data pipelines that ingest processed data from edge devices (robots) and deliver it to centralized storage and processing systems.Ensure reliable, scalable, and efficient data flow across different layers of the system architecture.Manage Knowledge DatabasesDeploy and optimize vector databases and graph databases to manage metadata and vectorized multimodal data (audio, text, video).Enable efficient and intelligent data retrieval for downstream AI systems.Ensure Data QualityCollaborate with internal teams (e.g., Conversation AI, Vision) to implement high-quality data filtering and distillation pipelines.Support the development of robust processes for large-scale data processing and refinement.Security and MonitoringImplement access control, monitoring, and alerting systems to ensure secure and stable data operations across multiple sites.Monitor pipeline health and system performance to maintain reliability.
Xem toàn bộ Mô Tả Công Việc
Yêu Cầu Công Việc
Technical RequirementsProgrammingStrong proficiency in Python and SQL for building and maintaining automated data pipelines.Cloud InfrastructureHands-on experience with AWS, particularly EC2 and S3, including compute and storage resource management.DatabasesExperience with Vector Databases such as Qdrant, Pinecone, or similar technologies.Familiarity with handling multimodal data (e.g., video, LiDAR, robot state data).Experience with mCAP or similar robotics data formats is a plus.Systems & InfrastructureSolid understanding of distributed systems, large-scale data processing, and data synchronization mechanisms. Expected OutcomesBuild a complete data pipeline infrastructure connecting cloud databases, processing servers, and local storage.Enable:Large-scale data movement with fast retrievalEfficient data querying and visualizationSuccessfully implement the data distillation pipeline to produce high-quality datasets for downstream AI systems.
Xem toàn bộ Yêu Cầu Công Việc
Hình thức
Full-time
Quyền Lợi
Competitive CompensationWorld-Class Team in Humanoid RoboticsCutting-Edge Humanoid Robot Products
Mức lương
Thỏa thuận
Báo cáo tin tuyển dụng: Nếu bạn thấy rằng tin tuyển dụng này không đúng hoặc có dấu hiệu lừa đảo,
hãy phản ánh với chúng tôi.
Tham khảo: 10 Dấu hiệu nhận biết hành vi lừa đảo qua tin tuyển dụng.
Tham khảo: 10 Dấu hiệu nhận biết hành vi lừa đảo qua tin tuyển dụng.