IJIEEB Vol. 17, No. 6, 8 Dec. 2025
Cover page and Table of Contents: PDF (size: 984KB)
PDF (984KB), PP.71-80
Views: 0 Downloads: 0
Generative Adversarial Network, Machine Learning, One-shot Learning, Python, Synthetic Data Generation, Enterprise Applications
This research tackles the fundamental requirement of synthetic data generation to tighten up machine learning model precision and one-shot shot learning to lessen the need to pursue data input. The project aims to develop a service that can generate reasonable synthetic data from a given dataset. It was first designed and developed, and then the project structure was set, and libraries were chosen for predevelopment analysis. This continued development process also included subsequent phases that included dataset collecting, assessment, and iterative research. Different hyperparameters were run over multiple models to select an optimal configuration. To evaluate the model's performance over produced synthetic datasets, about 1.5 and 2 times the original, synthetic data was produced, providing a basis for a robust synthetic data generating process.
Sreevishnu A. B., Suman De, "Synthetic Data Generation Using Generative Adversarial Networks for Enterprise Application", International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.17, No.6, pp. 71-80, 2025. DOI:10.5815/ijieeb.2025.06.06
[1]S. De and P. Mitra, "A Novel Technique of Synthetic Data Generation for Asset Administration Shells in Industry 4.0 Scenarios," in IEEE Transactions on Artificial Intelligence, doi: 10.1109/TAI.2024.3409516.
[2]Synthetic Data Vault (SDV), Version 0.17.2, Jan 24, 2023, Available: https://sdv.dev/SDV/index.html, Last Accessed: 29.01.2023.
[3]Friederich, J., Francis, D. P., Lazarova-Molnar, S., & Mohamed, N. (2022). A framework for data-driven digital twins of smart manufacturing systems. Computers in Industry, 136, 103586. https://doi.org/10.1016/j.compind.2021.103586
[4]Noseong Park, Mahmoud Mohammadi, Kshitij Gorde, Sushil Jajodia, Hongkyu Park, and Youngmin Kim. Data Synthesis based on Generative Adversarial Networks. PVLDB, 11 (10): 1071-1083, 2018. DOI: https://doi.org/10.14778/3231751.3231757
[5]Figueira, A.; Vaz, B. Survey on Synthetic Data Generation, Evaluation Methods and GANs. Mathematics 2022, 10, 2733. https://doi.org/10.3390/math10152733.
[6]Noseong Park, Mahmoud Mohammadi, Kshitij Gorde, Sushil Jajodia, Hongkyu Park, and Youngmin Kim. Data Synthesis based on Generative Adversarial Networks. PVLDB, 11 (10): 1071-1083, 2018. DOI: https://doi.org/10.14778/3231751.3231757.
[7]M. Naveen and S. De, "Efficient Digital Assistant Workflow for Ticket Monitoring and Dispatching," 2022 International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON), Bangalore, India, 2022, pp. 1-6, doi: 10.1109/SMARTGENCON56628.2022.10083722.
[8]Figueira, A.; Vaz, B. Survey on Synthetic Data Generation, Evaluation Methods and GANs. Mathematics 2022, 10, 2733. https://doi.org/10.3390/math10152733.
[9]César Quilodrán-Casas, Vinicius L.S. Silva, Rossella Arcucci, Claire E. Heaney, YiKe Guo, Christopher C. Pain, Digital twins based on bidirectional LSTM and GAN for modelling the COVID-19 pandemic, Neurocomputing, Volume 470, 2022, Pages 11-28, ISSN 0925-2312, DOI: 10.1016/j.neucom.2021.10.043.
[10]Sanitago Gomez, “Demystifying the CTGAN Loss Function | Synthetic Data Modeling”, sdv-team, Github Repository, Available: https://github.com/sdv-dev/SDV/discussions/980, Last Accessed: 01.06.2023.
[11]Table Evaluator, Available: https://baukebrenninkmeijer.github.io/table-evaluator/, Last Accessed: 31.01.2023.
[12]Lei Xu, Maria Skoularidou, Alfredo Cuesta-Infante, Kalyan Veeramachaneni, “Modeling Tabular Data using Conditional GAN”, October 2019, Available: https://arxiv.org/pdf/1907.00503.pdf, Last Accessed: 01.06.2023.
[13]Sanitago Gomez, “Demystifying the CTGAN Loss Function | Synthetic Data Modeling”, sdv-team, Github Repository, Available: https://github.com/sdv-dev/SDV/discussions/980, Last Accessed: 01.06.2023.
[14]Table Evaluator, Available: https://baukebrenninkmeijer.github.io/table-evaluator/, Last Accessed: 31.01.2023.
[15]Lei Xu, Maria Skoularidou, Alfredo Cuesta-Infante, Kalyan Veeramachaneni, “Modeling Tabular Data using Conditional GAN”, October, 2019, Available: https://arxiv.org/pdf/1907.00503.pdf, Last Accessed: 01.06.2023.