Data Science & Engineering Intern
Job Title: Data Science & Engineering Intern
Job Context:
CliqPack Limited is a multinational IT company at the forefront of digital transformation, delivering innovative software solutions and digital platforms to clients across the globe. We are looking for a motivated, analytical, and forward-thinking Data Science & Engineering Intern to join our growing data team.
This role is pivotal in supporting the design, development, and maintenance of data infrastructure that powers our SaaS platforms—such as CliqProperty, POS, ERP, CRM—and custom enterprise applications. The selected intern will gain real-world exposure in data pipeline development, ETL processes, big data platforms, and AI/ML-powered analytics, while working alongside an experienced engineering team in a modern agile environment.
Duties & Responsibilities:
Data Collection, Pipeline & ETL Support
- Assist in collecting and preprocessing data from websites and APIs using Selenium, BeautifulSoup, Pandas, and Python.
- Design and implement database schemas and develop SQL queries to support ETL workflows.
- Collaborate with senior engineers to optimize data pipelines for performance, scalability, and reliability.
- Automate routine data tasks while ensuring data accuracy, consistency, and integrity.
Data Integration, Quality & Analytics
- Integrate data from APIs, third-party services, and internal platforms.
- Monitor and troubleshoot pipeline issues to ensure data reliability and completeness.
- Perform data validation, quality checks, and reporting.
- Support feature engineering and preprocessing for machine learning workflows
AI, Machine Learning & Automation
- Assist in building and evaluating machine learning models using Scikit-learn, TensorFlow, PyTorch, and related frameworks.
- Contribute to ML model training, experimentation, and evaluation.
- Implement automation workflows for routine data tasks using Python/SQL and ML libraries.
Ethical Automation & Anti-Bot Handling (Priority)
- Knowledge/Experience with ethical CAPTCHA and anti-bot handling, including human-in-the-loop flows, authenticated scraping, session management, proxy rotation, and rate-limiting, is considered a plus.
Collaboration & Professional Development
- Collaborate with engineers, analysts, and business stakeholders to deliver actionable data solutions.
- Participate in agile ceremonies, knowledge-sharing sessions, and team discussions.
- Continuously develop expertise in modern data engineering practices, big data frameworks (Spark/Hadoop), cloud services (AWS, GCP, Azure), and AI/ML integration methods.
Educational Qualifications:
Experience Requirements:
- Freshers are encouraged to apply.
- Prior internship, academic projects, or coursework in data engineering, ML pipelines, database management, or AI applications is a plus.
Additional Requirement:
- Age: 22–26 years.
- Proficiency in Python, SQL, or Java for data-related tasks
- Familiarity with ETL processes, data warehousing, and database design.
- Exposure to ML/AI concepts: supervised/unsupervised learning, model training, and data preprocessing.
- Knowledge of data visualization tools: Matplotlib, Seaborn, Plotly.
- Knowledge of MS Excel/Google Sheets for quick data handling.
- Interest in data scraping, mining, ML/AI model deployment, and pipeline automation.
- Strong analytical, problem-solving, and communication skills in English
- Team player in collaborative, fast-paced environments.
Compensation & Other Benefits:
- Two weekly holidays: Friday & Saturday
- Annual Salary Review (upon confirmation)
- Opportunity to work on international-grade SaaS and AI-driven products
- Full-time employment opportunity upon successful completion of 6-month internship