In Cyclad we work with top international IT companies in order to boost their potential in delivering outstanding, cutting edge technologies that shape the world of the future. Currently, for our client, we are looking for a talented and dedicated Software or Data Engineer to join a team, focusing on the developments of capabilities to enable Data Mesh, and offering expertise in leveraging contemporary technologies. This involves advising on the utilization of services available on public clouds or developing in house services on internal Kubernetes-based platform.
Project information:
Your tasks:
Design, develop, and maintain scalable data architectures including databases, data lakes, and data warehouses. Implement best practices for data storage, retrieval, and processing. Drive the adoption of Data Mesh principles to promote decentralized data ownership and architecture.
Develop and maintain reusable data products that serve various business units. Collaborate closely with data scientists and analysts to understand data needs and design data models that meet business requirements. Develop and maintain ETL (Extract, Transform, Load) processes for moving and transforming data from various sources into the data infrastructure aligned with Data Mesh principles.
Implement and enforce data quality standards and governance policies. Develop and maintain metadata documentation, data lineage, and data dictionaries for all data assets to ensure they are discoverable and accessible across the organization.
Design and implement Kubernetes-based deployment strategies for scalable, reliable, and manageable data technologies. Collaborate with DevOps and infrastructure teams to optimize data technology deployment processes within a Kubernetes environment.
Document Data Mesh implementations, Proof of Concept (PoC) results, and best practices to share knowledge and create reference materials for future use.
Requirements:
- RDBMS (PostgreSQL/MySql etc.)
- NoSQL Storages (MongoDB, Cassandra, Neo4j etc.)
- Timeseries (InfluxDB, OpenTSDB, TimescaleDB, Prometheus etc.)
- Workflow orchestration (AirFlow/Oozie etc.)
- Data integration/Ingestion (Flume etc) .
- Messaging/Data Streaming (Kafka/RabbitMQ etc.)
- Data Processing (Spark, Flink etc.) And/Or with their Cloud provided counterparts, i.e., Cloud Data/Analytics services (GCP, Azure, AWS)
We offer:
We proudly deliver to the leaders across industries.