Multi-cloud infrastructure, cloud architecture, multi-cloud global network, infra and compute platform, Azure,
Amazon AWS, infrastructure as code, continuous integration, continuous deployment, cross-functional team
of teams building and execution
Multi-cloud infrastructure, cloud architecture, multi-cloud identity management, Azure, Amazon AWS, Active
Directory, IAM, Cloud Shell. PowerShell, Python. LDAP, VPC, VPN (IPSEC tunnels)
Evolution of the legacy implementation to the new generation IT infrastructure using container orchestration on
cloud. Cloud end to end containerization, provisioning and deployment architecture.
Evolution of the back-end processing to new generation cloud services.
More details not possible due to NDA
Full automation of Linux installation for multiple deployment profiles targeting very large DB systems
solution using Dockers-Openshift-Kubernetes
Evolution of the legacy implementation to the new generation IT infrastructure using Red Hat on premise
cluster platform. Focus is on security and modern modular IT approach using micro-services, Docker and
application container orchestration via Kubernetes to provide IT services on demand
More details not possible due to NDA
Data integration with iRODS object store
• R Studio, SQL, R, C++, PostgreSQL, RPC, Git, VSTS, Visual Studio 2015, CMake, SSL
• R/C++ binding library to allow data access to iRODS from R programming session
• ********
Data integration between Clarity and other various LIMS systems
• JBOSS, Apache Tomcat, REST API, PostgreSQL, Pentaho, XML messaging, SSL
• Automatic biological sample tracking synchronization between Clarity and LIMS systems
• The workflow will ensure the bi-directional XML/REST messaging communication between the
systems
• The updates in both systems are going through SSL secure connection with clean transactional
semantics to make sure the updates are auditable and safe to guarantee data consistency
Data and API integration with the R-Studio statistical development environment
• R, R Studio, PostgreSQL, BI, SQL, OLAP, multi-dimensional data sets, data-frames
• R package developed to provide access to clean analytic data sets ready for analysis from R
programming session
• The package provides user friendly functions to slice-n-dice data sets with OLAP on server side if
possible
Data integration for REST APIs in IP search domain
• Galaxy, REST, Python
• POC helping to fetch data from the selected public databases into Galaxy workflow engine based on
the given data signatures
• The data is used for predictive Intellectual Property analysis related to patent IP knowledge research
High throughput screening data integration (Molecular Devices/Leica/Olympus/MDC
Store/Vendor APIs)
• C++, Java, Maven, Visual Studio 2013/2015, OpenCV, Bioformats, Python, Flask, JSON, XML, data
processing pipelines, iRODS, meta-data, high-definition image processing and analysis
• The system for extract-transform-load of microscopy HD images to be transferred into object broker
data storage engine and used from there for image processing, analysis and machine learning
• The data processing pipelines were connected to the iRODS object store to search and load the
selected image data sets and perform the image processing and analysis jobs
New generation data-warehouse for multi-OMICS analytics with output to multi-dimensional data
structures
• BI, OLAP, Kimball, ETL, SQL, MySQL, PostgreSQL, Pentaho, CI, CD
• Multi-study and clinical trials secured data platform connected to integrated data-warehouse
• Technology stack evaluation and proposal for the project
• Design of the all DWH zones
• Design of the continuous data integration workflow
• Design of the data distribution model for multi-dimensional data sets
• Implementation of the ETL workflows
• Development of the data ingestion and data distribution ETL jobs
• Design and architecture of the data access layer for the data scientists in the distribution zone
High throughput data processing platform - full stack (Python, Mongo, NODE.JS)
• Full-stack: NODE.JS, JavaScript, custom frameworks, JSON, Mongo DB
• Back-end: Python, C, C++, Perl, Java, Flask, Condor, LFS, Docker, REST, SSL, R, JSON
• Design of the LIMS/MongoDB/Back-End ETL synchronization workflow
• Design of the LIMS/Mongo/iRODS/Back-End ETL workflow
• Design of the LIMS/Mongo.iRODS/Microscopy ETL workflow
• Development of the Microscopy image processing pipeline framework
• Development of the iRODS data integration layer
More details not possible due to NDA
Total automation system and data integration for continuous integration using REST and SOAP APIs for JIRA, Perforce
and QuickBuild
Improved Continuous integration 24/7, build system, test and diagnostics automation
Improved multi-platform code quality assurance process on Android, Linux, Windows, iOS
Improved the Android performance tuning, debugging, root cause analysis process
More details not possible due to NDA
HTML5 live & VOD cloud video streaming platform on Linux Ubuntu/CentOS VMs
Implemented support for WEBM video container
Improved buffering scheme and multi-threading of the streaming relay server More details not
possible due to NDA
Technology & design advisory for the cloud processing system of histopathology data at early stage
More details not possible due to NDA