
Have horrible Excel files? We have considerable practical experience automatically processing thousands of legacy Excel files per day in big pharma production environments (CRO data in sponsored clinical studies, in vivo CRO results, raw lab instrument data).
Parse the files with an automated multi-step reader, then import the data into a database or collate it for on-disk or cloud bucket storage. Data validation is key in this process. We have learned from bitter experiences and will help you with data validation steps to make the capture or migration as painless and efficient as possible.
Have a legacy database that is being replaced by another? We can help you migrate the data. Well-written migration scripts ensure that every step in the process is consistent and reproducible. What might have been an art becomes an exact science.
Automation is Key to Reproducibility
Automation of every data cleanup step ensures consistency and reproducibility in how the data are processed, analyzed, reported to end users, and archived. In regulated environments it also helps create a reliable chain of custody for the data.
Machine Learning (ML) and artificial intelligence (AI)
Are you starting a deep learning/AI project? Do you have the data? We will help you avoid data preparation mistakes at the project start. A consistent, rule-based, well-documented cascade of steps applied to raw data to filter and balance it, will make all the difference when modeling begins.
We bring to the table extensive practical experience in the field as an active practitioner. We honor strict client confidentiality and bring an industry insider perspective to your team and your project.