March 5, 2024

Tugu Forever

Computer Will Live On Forever

Big data infrastructure internship | Adaltas

4 min read
Big data infrastructure internship | Adaltas

Work description

Big Data and distributed computing are at the core of Adaltas. We accompagny our partners in the deployment, maintenance, and optimization of some of the biggest clusters in France. Due to the fact just lately we also provide help for day-day operations.

As a fantastic defender and lively contributor of open source, we are at the forefront of the facts platform initiative TDP (TOSIT Facts Platform).

For the duration of this internship, you will lead to the improvement of TDP, its industrialization, and the integration of new open resource components and new functionalities. You will be accompanied by the Alliage skilled workforce in demand of TDP editor support.

You will also do the job with the Kubernetes ecosystem and the automation of datalab deployments Onyxia, which we want to make out there to our buyers as perfectly as to college students as portion of our instructing modules (devops, big data, etcetera.).

Your qualifications will assistance to grow the services of Alliage’s open supply guidance featuring. Supported open up resource parts contain TDP, Onyxia, ScyllaDB, … For people who would like to do some world wide web do the job in addition to significant information, we presently have a extremely practical intranet (ticket management, time administration, advanced lookup, mentions and relevant posts, …) but other pleasant characteristics are anticipated.

You will follow GitOps launch chains and write posts.

You will perform in a group with senior advisors as mentor.

Organization presentation

Adaltas is a consulting company led by a team of open up supply professionals concentrating on data administration. We deploy and work the storage and computing infrastructures in collaboration with our shoppers.

Associate with Cloudera and Databricks, we are also open up resource contributors. We invite you to browse our web page and our lots of complex publications to learn far more about the firm.

Techniques necessary and to be obtained

Automating the deployment of the Onyxia datalab necessitates information of Kubernetes and Cloud native. You must be comfortable with the Kubernetes ecosystem, the Hadoop ecosystem, and the distributed computing product. You will master how the basic components (HDFS, YARN, object storage, Kerberos, OAuth, and many others.) function with each other to fulfill the takes advantage of of major knowledge.

A very good know-how of employing Linux and the command line is demanded.

Throughout the internship, you will master:

  • The Kubernetes/Hadoop ecosystem in order to lead to the TDP undertaking
  • Securing clusters with Kerberos and SSL/TLS certificates
  • High availability (HA) of services
  • The distribution of sources and workloads
  • Supervision of expert services and hosted applications
  • Fault tolerant Hadoop cluster with recoverability of dropped knowledge on infrastructure failure
  • Infrastructure as Code (IaC) by means of DevOps applications these types of as Ansible and [Vagrant](/en/tag/hashicorp- vagrant/)
  • Be relaxed with the architecture and procedure of a info lakehouse
  • Code collaboration with Git, Gitlab and Github


  • Develop into familiar with the architecture and configuration approaches of the TDP distribution
  • Deploy and examination secure and highly offered TDP clusters
  • Add to the TDP expertise base with troubleshooting guides, FAQs and article content
  • Actively contribute suggestions and code to make iterative advancements to the TDP ecosystem
  • Investigation and examine the differences among the primary Hadoop distributions
  • Update Adaltas Cloud making use of Nikita
  • Contribute to the development of a resource to collect buyer logs and metrics on TDP and ScyllaDB
  • Actively add suggestions to develop our aid option

Further facts

  • Place: Boulogne Billancourt, France
  • Languages: French or English
  • Starting day: March 2023
  • Duration: 6 months

Significantly of the digital globe operates on Open up Source application and the Massive Details sector is booming. This internship is an possibility to get precious experience in both domains. TDP is now the only really Open up Resource Hadoop distribution. This is a great momentum. As part of the TDP crew, you will have the possibility to master a single of the core significant info processing designs and take part in the enhancement and the future roadmap of TDP. We think that this is an fascinating opportunity and that on completion of the internship, you will be prepared for a prosperous occupation in Big Data.

Devices out there

A notebook with the next characteristics:

  • 32GB RAM
  • 1TB SSD
  • 8c/16t CPU

A cluster designed up of:

  • 3x 28c/56t Intel Xeon Scalable Gold 6132
  • 3x 192TB RAM DDR4 ECC 2666MHz
  • 3x 14 SSD 480GB SATA Intel S4500 6Gbps

A Kubernetes cluster and a Hadoop cluster.


  • Income 1200 € / month
  • Restaurant tickets
  • Transportation go
  • Participation in a person international meeting

In the earlier, the conferences which we attended incorporate the KubeCon structured by the CNCF basis, the Open Source Summit from the Linux Foundation and the Fosdem.

For any ask for for extra data and to submit your application, please speak to David Worms: