Data Engineer
Job Description
Must Have Technical/Functional Skills
- Minimum 7 years of overall experience in data engineering roles.
- Minimum 3 years of hands-on experience with Flink, Iceberg, and Starburst
- Extensive hands-on experience (3+ years) with Apache Flink for real-time stream processing,
including Flink SQL and Flink DataStream API.
- Proven experience with Apache Iceberg, specifically in designing and managing data lakehouse
architectures, schema evolution, and performance optimization of Iceberg tables.
- Solid experience with Starburst (Trino) or open-source Trino for data virtualization, federated querying,
and connecting to various data sources.
- Proficiency in at least one programming language commonly used in data engineering (e.g., Python, Scala, Java).
- Strong experience with cloud platforms (AWS, Azure, or GCP) and their respective data services
(e.g., S3, ADLS, EMR, Databricks, Snowflake, BigQuery, Redshift).
- Deep understanding of distributed systems, data warehousing concepts, ETL/ELT methodologies,
and data modeling.
- Experience with containerization (Docker) and orchestration (Kubernetes).
- Familiarity with messaging systems like Apache Kafka.
- Excellent problem-solving, analytical, and communication skills, with the ability to articulate complex
technical concepts.
Roles & Responsibilities
- Real-time Data Pipeline Development: Design, develop, and maintain robust, high-throughput real-time data
streaming and processing pipelines using Apache Flink for complex event processing, stream analytics, and continuous transformations.
- Data Lakehouse Architecture: Implement and manage data lakehouse solutions leveraging Apache Iceberg for table
format management, ensuring ACID transactions, schema evolution, and efficient data versioning on large-scale data lakes.
- Data Virtualization & Federated Query: Utilize Starburst Enterprise (Trino) for data virtualization and federated
querying across diverse data sources, optimizing query performance and enabling unified data access for analytics and
reporting.
- ETL/ELT Development: Develop, optimize, and maintain traditional and modern ETL/ELT processes using various
tools and programming languages (e.g., Python, Scala, Java) to ingest, transform, and load data into analytical systems.
- Cloud Data Platform Integration: Integrate data solutions with Client's cloud infrastructure (e.g., AWS, Azure, GCP)
and leverage native cloud data services for storage (e.g., S3, ADLS), compute, and analytics.
- Performance Tuning & Optimization: Identify and resolve performance bottlenecks within data pipelines, Flink jobs,
Iceberg tables, and Starburst queries, ensur ing optimal resource utilization and query response times.
- Data Quality & Governance: Implement data quality checks, monitoring, and alerting mechanisms within data pipelines.
Ensure adherence to data governance policies, metadata management, and data lineage standards.
- Architectural Input: Contribute to the architectural design and evolution of Client's data platform, providing expertise
on Flink, Iceberg, and Starburst capabilities and best practices.
- Collaboration & Mentorship: Work closely with data architects, data scientists, business analysts, and other
engineering teams. Mentor junior engineers and foster a culture of technical excellence and continuous learning.
- Operational Excellence: Establish monitoring, logging, and alerting for data pipelines and infrastructure. Participate
in on-call rotations as needed to ensure the reliability and availability of data systems.
- Documentation: Create and maintain comprehensive technical documentation for data pipelines, architecture,
and operational procedures.
Salary Range: $115,000 to $140,000 per year
**Location**
Irving, TX
**Job Function**
TECHNOLOGY
**Role**
Engineer
**Job Id**
400533
**Desired Skills**
PL/SQL
**Salary Range**
$115,000-$140,000 a year
Desired Candidate Profile
**Qualifications** : BACHELOR OF COMPUTER SCIENCE
Verified Visa Sponsor
More from Tata Consultancy Services (TCS)
Visa Sponsorship Data
AI Resume Tailoring
Tailor your resume for Data Engineer roles
Reach hiring managers at tata consultancy services
AI Cover Letters for Data Engineer
Generate tailored cover letters, recruiter emails, and LinkedIn messages matched to your resume.
- Tailored to your resume & job
- Cover letters, emails, LinkedIn messages
- Professional tone, your experience
