We had some difficulties in how to accomplish this project, plus I had budget shortfall, creating a dataset may be costly. We settled on a synthetic dataset and plan on bolting-on additional parameters upon correctly building a dataset.
The data pipeline contains ingestion, validation, transformation, training, evaluation, and a prompt Front end.
The biggest hangup was experimental design of the dataset, as well as finding localization UV/IR spectroscopy lab availability.
This project is seeking funding to create a bigger dataset, the hard programming is complete, thanks to Partha Pratim Kalita, and Modeling by Suraj Mishra
Dagshub Repository