BIG DATA & CLOUD EXPERT
Société Générale -CIB | Paris
11/2020 - 12/2023
Corporate & investment banking
Projects:
&bull Automate deployment of infrastructure and containerized applicaons on the
private and public cloud, using CI/CD pipelines.
&bull Projects migraon from HDP to Cloudera clusters
&bull Perform secure migraon to the cloud by designing and deploying landing
zones
Tasks:
&bull Design and automate Cloudera clusters infrastructure and components
deployment
&bull Automate Cloudera administraon tasks and configuraon management
&bull Automate data migraon from HDP to Cloudera clusters
&bull Provide support to development and business teams during and aer
migraon to Cloudera Clusters
&bull Automate/integrate infrastructure and soluon deployment on private cloud
&bull Design and configure Azure landing zones with respect to key design
principles
&bull Design and deploy containerized compung soluon on Azure and AWS
Kubernetes services, using ArgoCD/Argo Workflows.
&bull Realise PoC on Azure and AWS Kubernetes services.
&bull Define architecture of new features including observability on Azure and AWS
Kubernetes
&bull Design and implement GitOps pipelines for configuraon management
&bull Troubleshoot CI/CD pipelines and migraon/deployment related problems
Tools:
Hadoop CDP, Spark, Git/Github, Ansible, Terraform, Jenkins, ArgoCD/Argo
Workflows, Linux RedHat, Azure/AWS, Kubernetes, Helm, prometheus,
Grafana, Elasc ELK, Airflow
BIG DATA ARCHITECT
Cyllene Group | Nanterre
5/2023 - 7/2024
Digital transformaon operator
Project: Migrate Hadoop cluster to Cloudera
Tasks:
&bull Plan and execute HDP Hadoop cluster migraon to CDP
&bull Plan data and applicaons migraon
&bull Troubleshot post migraon problems
&bull Advice on backup and monitoring soluons
Tools:
HDP, CDP, Spark, Pormetheus/Grafana
BIG DATA EXPERT
Dassault Systèmes | Paris
7/2021 - 7/2021
Project: Design and deploy a soluon for data classificaon
Tasks:
&bull Design architecture for Data classificaon and security soluon
&bull Implement/integrate the designed soluon using CI/CD pipelines
&bull Advice on compung soluon and data storage design
Tools:
Git, Ansible, Jenkins, Linux Redhat, Docker, Kubernetes, Spark, Atlas, Ranger
Crédit Agricole (Retail banking)
11/2021 -
Architecture documentation and tech integration steering of 2 first instances of new CDP CA group offer
Integration and architecture documentation of multi-tenant Dataiku new CDP CA group offer for retail banking data science (POV)
_CDP, HDP&HDF, Dataiku, Rstudio, Teradata&hellip
Société Générale
9/2019 -
Member of BigData architecture authority team, in charge of designing new gen BigData platform
Architecture design, specification & POC implementation of a multi-sources low-latency data access solution end to end
Architecture design & specifications of a stream+batch data ingestion as a service to multiple hybrid BigData platforms
_Presto, Pulsar, Kafka, HDP, CDP, Hudi, CarbonData, Prometheus &hellip
ATOS for Directorate General of Armament (DGA)
2/2019 - 4/2020
Architecture design & specifications of federated data governance repository providing data lineage, classification and access restriction.
_Atlas, Egeria, Ranger, Keycloak, Cassandra, Accumulo, MongoDB, Redis,
Architecture design and implementation
BouyguesTelecom
10/2018 - 2/2019
HDP 3.1 cluster deployment (replacing Cloudera 5.7 legacy cluster) + historical data duplication from legacy cluster
NiFi cluster deployment (replacing Sqoop existing flows on legacy) and NiFi template for Sqoop replacement
Documentation & support
Architecture documentation, security solution (using Atlas + Ranger + KRB + HDFS TDE), HDP tuning/customizing & Best Practices
Support Spark dev team adapting Spark to new platform & DevOps team for integration and customisation
_Hadoop HDP & Cloudera CDP, NiFi, Kafka, ElasticSearch, Ignite, Ansible
Architecture design and implementation
Kering
3/2018 - 12/2018
Design and implement Hadoop solution on AWS: EMR + S3 (+ NiFi + Kafka + Cassandra on-prem)
Deployment fast layer OLAP in Memory on AWS: Druid + Superset + MR2 (for DRUID indexing)
Tuning performances and resilience of existing infra on-prem: Cassandra, SolR, Zookeeper, Elastic, Spark cluster standalone
Design & POC sol. replication SolR and Cassandra from on-prem DCs to 3 AWS regions (US, AP, EU)
Documentation & support
Architecture documentation & best practices for best performances and reliability
_Hadoop EMR, Druid, Superset, Spark standalone & on YARN, Hive,
Architecture design and implementation
Renault-Nissan-Mitsubishi
1/2017 - 3/2018
Build new highly multi tenant HDP 2.6 clusters (for each brand) + participation in kerberisation process
Design resilient and scalable SQL stack for HDP meta stores (MariaDB+Galera+ProxySQL)
Design, delivery and tuning new infrastructure (Linux RH7 + edges virtualisation) + Network evolution to leaf-spine arch. (Cisco ACI)
Design and delivery stack monitoring / metrology / capacity planning (Prometheus + Grafana + DrElephant&hellip)
Design and delivery stack benchmark & non-reg. tests (HiBench, YCSB, TPC-DS/TPC-H, SYSbench, custom benches)
Performances optimisation & benchmarks of most HDP stack components and a lot of projects deliveries
Usage model & Capacity Planning of HDP clusters & ElasticSearch
Documentation & support
Documentation of architecture, HDP best practices for stack administrators, developers and end users
Troubleshooting HDP and connected tech. (like HUE, R/Rstudio), ElasticSearch, Java, infra (Linux & network)
Interface with infra teams: Linux, Network and DCs
L3/4 users support (end users, data engineers, data scientists) especially regarding performances issues
AMS, Sqoop, OOZIE, Knox, Ranger, NiFi, Kafka, HUE, ElasticSearch, Rstudio srv, R, Anaconda&hellip
SFIL
11/2015 - 7/2017
Technical architecture design for business projects (DWH, BO BI, SAB AT, Moody&rsquos, GMS&hellip) and non-prod. deployment
IS infrastructure transformation to SDDC scenarii (using VMWare stack)
Support for infrastructure and production teams during major evolutions, complex issues & performances optimisation
POC and integration of new technical bricks in IS (Good Technology, Container W2016, Ignite, MongoDB&hellip)
_MongoDB, Ignite, R, WebSphere, IIS, Tomcat, Windows, VMware, Citrix, SQLserver, Oracle, AS400&hellip