Srujana Manigonda, 2024. "Scaling Enterprise Data Systems for Complex Reporting and Analytics at the Enterprise Level" ESP International Journal of Advancements in Computational Technology (ESP-IJACT) Volume 2, Issue 2: 125-132.
In a world where data grows exponentially, enterprises often struggle with managing disparate data sources, leading to confusion over which sources to trust, inefficiencies in analysis, and time-consuming decision-making processes. This paper introduces the development of a unified data pipeline designed to standardize and integrate data from multiple sources into a single source of truth. By implementing this solution, organizations can support tech-based ad-hoc solutions, deliver consistent and accurate data to stakeholders, and foster collaboration across tech, product, and process teams. The pipeline enhances data accessibility, eliminates redundancy, ensures scalability, and supports real-time insights, enabling faster and better decision-making.
[1] H. Hu, Y. Wen, T. -S. Chua and X. Li, "Toward Scalable Systems for Big Data Analytics: A Technology Tutorial," in IEEE Access, vol. 2, pp. 652-687, 2014, doi: 10.1109/ACCESS.2014.2332453.
[2] Singu, Santosh Kumar. "Designing Scalable Data Engineering Pipelines Using Azure and Databricks." ESP Journal of Engineering & Technology Advancements 1.2 (2021): 176-187.
[3] J. Patel, "An Effective and Scalable Data Modeling for Enterprise Big Data Platform," 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, pp. 2691-2697, doi: 10.1109/BigData47090.2019.9005614.
[4] Trom, L., Cronje, J. (2020). Analysis of Data Governance Implications on Big Data. In: Arai, K., Bhatia, R. (eds) Advances in Information and Communication. FICC 2019. Lecture Notes in Networks and Systems, vol 69. Springer, Cham. https://doi.org/10.1007/978-3-030-12388-8_45
[5] Ndamase, Zimasa. "The impact of data governance on corporate performance: the case of a petroleum company." (2014).
[6] Qingqiang Zhang, Xinbo Sun, Mingchao Zhang, Data Matters: A Strategic Action Framework for Data Governance, Information & Management,Volume 59, Issue 4, 2022, 103642, ISSN 0378-7206, https://doi.org/10.1016/j.im.2022.103642.
[7] E. R. Sparks, S. Venkataraman, T. Kaftan, M. J. Franklin and B. Recht, "KeystoneML: Optimizing Pipelines for Large-Scale Advanced Analytics," 2017 IEEE 33rd International Conference on Data Engineering (ICDE), San Diego, CA, USA, 2017, pp. 535-546, doi: 10.1109/ICDE.2017.109.
[8] T. von Landesberger, D. W. Fellner and R. A. Ruddle, "Visualization System Requirements for Data Processing Pipeline Design and Optimization," in IEEE Transactions on Visualization and Computer Graphics, vol. 23, no. 8, pp. 2028-2041, 1 Aug. 2017, doi: 10.1109/TVCG.2016.2603178.
[9] H. P. Kothandapani, “Optimizing Financial Data Governance for Improved Risk Management and Regulatory Reporting in Data Lakes”, IJAMCA, vol. 12, no. 4, pp. 41–63, Apr. 2022.
[10] Ahmad Faizal and Nur Aisyah, “Innovative Approaches to Enterprise Database Performance: Leveraging Advanced Optimization Techniques for Scalability, Reliability, and High Efficiency in Large-Scale Systems”, SSRAML, vol. 7, no. 1, pp. 42–65, Jan. 2024.
Salesforce CRM Cloud, Digital Transformation, Cloud Computing, Customer Relationship Management.