Mô Tả Công Việc
Position OverviewWe’re looking for a talented Data Engineer with strong AWS expertise to design, build, and maintain the data infrastructure that powers our vehicle inspection platform. At Pave.ai, you’ll be responsible for developing scalable and reliable data pipelines that process millions of vehicle inspections, images, and automotive data points — delivering real-time insights to customers across the automotive ecosystem.In this role, you will collaborate closely with our engineering and data science teams based in both Canada and Vietnam, working together to design end-to-end solutions that support advanced analytics, machine learning models, and business intelligence tools. You’ll play a key role in ensuring data accuracy, scalability, and system performance. Key ResponsibilitiesData Pipeline DevelopmentDesign and implement scalable ETL/ELT pipelines for processing vehicle inspection data, images, and metadataBuild real-time data processing workflows for instant inspection results and damage detectionCreate data ingestion solutions from mobile apps, APIs, IoT devices, and third-party automotive systemsImplement data quality frameworks to ensure inspection accuracy and complianceOptimize pipelines for processing high-volume image data and computer vision outputsAWS Data Platform ManagementArchitect data warehousing solutions using Amazon Redshift for vehicle inspection analyticsDesign schemas optimized for automotive data (VIN, inspection history, damage reports, pricing)Implement data lakes using S3 for storing inspection images, videos, and unstructured dataManage inspection metadata and vehicle catalogs using AWS Glue Data CatalogBuild ML-ready datasets for computer vision and damage detection modelsAnalytics & VisualizationDevelop QuickSight dashboards for vehicle inspection metrics, damage trends, and pricing analyticsCreate self-service analytics for dealerships, insurers, and fleet operatorsBuild real-time inspection monitoring dashboards for quality assuranceImplement predictive analytics for vehicle valuation and damage assessmentDesign automated reports for inspection volumes, accuracy rates, and customer KPIsData Integration & OrchestrationIntegrate with automotive data providers (Carfax, KBB, automotive APIs)Build real-time processing for mobile inspection data using KinesisImplement workflows connecting inspection data with customer CRMs and dealer management systemsDesign event-driven architectures for inspection status updates and notificationsCreate APIs for inspection data access by partners and third-party platformsInfrastructure & OperationsImplement Infrastructure as Code using CloudFormation or TerraformSet up monitoring and alerting using CloudWatch and SNSEnsure data security through encryption, VPC configuration, and IAM policiesOptimize AWS costs through resource management and Reserved InstancesMaintain data recovery and backup strategiesOwn operational reliability of the data platform, including versioned pipelines, CI/CD integration for test data provisioning, and improvements in data quality and governance to prevent application failures from raw vs. processed data mismatches
Xem toàn bộ Mô Tả Công Việc
Yêu Cầu Công Việc
Experience4+ years of experience as a Data Engineer or similar role3+ years of hands-on experience with AWS data servicesExperience with image/video data processing and storage at scaleBackground in automotive, insurance, or inspection technology is a plusProven track record of building production data pipelines for high-volume consumer applicationsTechnical SkillsAWS Services Expertise:Amazon Redshift: Cluster management, performance tuning, SpectrumAmazon QuickSight: Dashboard development, SPICE, ML insightsAWS Glue: ETL jobs, crawlers, data catalogAmazon S3: Data lake architecture, lifecycle policies, partitioningAmazon Athena: Query optimization, partition projectionAmazon Kinesis: Real-time data streaming and analyticsAWS Lambda: Serverless data processingAmazon EMR: Big data processing with Spark/HadoopProgramming & Tools:Strong programming skills in Python and SQLExperience with PySpark or Spark SQLProficiency with Git and CI/CD pipelinesKnowledge of data orchestration tools (Airflow, Step Functions)Familiarity with dbt (data build tool) for data transformationData Engineering Concepts:Strong understanding of data warehousing and data lake architecturesExperience with both batch and stream processing paradigmsKnowledge of data modeling techniques (star schema, data vault)Understanding of data governance and lineageCore CompetenciesStrong analytical and problem-solving skillsExcellent communication skills for working with technical and business stakeholdersSelf-motivated with ability to work independentlyDetail-oriented approach to data qualityPassion for automation and optimizationPreferred QualificationsAWS Certifications (Solutions Architect, Data Analytics Specialty)Experience with computer vision data pipelines and ML model deploymentKnowledge of automotive industry data standards (VIN decoding, OBD-II)Experience with geospatial data and location-based analyticsFamiliarity with image optimization and CDN strategiesUnderstanding of data privacy regulations (GDPR, CCPA) for consumer dataExperience with mobile app analytics and real-time data synchronizationBackground in building multi-tenant SaaS data architectures
Xem toàn bộ Yêu Cầu Công Việc
Hình thức
Full-time
Quyền Lợi
Competitive salaryFlexible work arrangements, including hybrid options13th-month bonus in accordance with company policyComprehensive health, dental, and vision insurance for the employee and one dependentProfessional development budgetOpportunity to shape the future of AI technologyCollaborative and innovative work environment
Mức lương
Thỏa thuận
Báo cáo tin tuyển dụng: Nếu bạn thấy rằng tin tuyển dụng này không đúng hoặc có dấu hiệu lừa đảo,
hãy phản ánh với chúng tôi.
Tham khảo: 10 Dấu hiệu nhận biết hành vi lừa đảo qua tin tuyển dụng.
Tham khảo: 10 Dấu hiệu nhận biết hành vi lừa đảo qua tin tuyển dụng.