Mô Tả Công Việc
About Us:
We, ABC Studio (AI Bigdata Content Studio), are a Korea–Vietnam AI company specializing in AI optimization, generative AI, and big data engineering — with a focus on market intelligence, visual content, and Edge AI for smart devices.
Our visions are:
To become the global best market data company, having global e-commerce, KOLs, and SNS bigdata
To become the leading innovative AI contents engineering company at movie (VFX) and webtoon and social marketing.
To develop a solid AI infrastructure layer for efficient deployment on cloud, mobile, and embedded devices
We are waiting for enthusiastic and talented interns who are willing to accompany our long and meaningful journey together.
Job Description:
Perform web data mining, big data extraction from a variety of online sources.
Clean, transform, and validate data for use in analytics and machine learning applications.
Design and manage data warehouses, data lakes, and cloud-based storage solutions.
Automate data pipelines and workflows using Python, PySpark, and tools like Apache Airflow.
Trách nhiệm công việc
Big Data Mining: Extract and mine large-scale datasets from major e-commerce platforms in Vietnam, China, Korea, Southeast Asia,…
Data Processing: Clean, transform raw data into structured formats suitable for analytics and machine learning.
Data Infrastructure: Build automated pipelines and cloud solutions. (e.g., AWS, GCP,…).
Data Integration and Management:** Develop data warehouses and data lakes for optimal data storage and retrieval.
LLM Data Pipeline: Develop pipelines for Large Language Models (LLM), including RAG , LangChain, or LangGraph.
Visualization: Create visualizations and reports to communicate insights effectively.
Kỹ năng & Chuyên môn
Education: Final year student or fresh graduate in Computer Science, Data Science, Information Technology, or related fields
Technicail Skills:
Proficient in Python, with experience using Pandas, PySpark, or similar libraries
Experience with web scraping tools (e.g., BeautifulSoup, Scrapy, Selenium)
Understanding of data architecture: warehouses, lakes, and cloud storage
Familiarity with ETL/ELT tools (e.g., Apache Airflow) and SQL
Basic knowledge of web structures (HTML/CSS/JS) is a plus
Soft Skills: Strong problem-solving skills, attention to detail, and a passion for data engineering.
Communication skills: good communication skills in Vietnamese and English
Phúc lợi dành cho bạnWhat You Will Learn
Web data mining and handling large-scale real-world datasets
Building automated data pipelines with Python, PySpark, and Airflow
Xem toàn bộ Mô Tả Công Việc
Hình thức
Full-time
Mức lương
Thỏa thuận
Báo cáo tin tuyển dụng: Nếu bạn thấy rằng tin tuyển dụng này không đúng hoặc có dấu hiệu lừa đảo,
hãy phản ánh với chúng tôi.
Tham khảo: 10 Dấu hiệu nhận biết hành vi lừa đảo qua tin tuyển dụng.
Tham khảo: 10 Dấu hiệu nhận biết hành vi lừa đảo qua tin tuyển dụng.