Αυτόματη άντληση, μετασχηματισμός και απεικόνιση επιχειρηματικών δεδομένων με τεχνολογίες επιχειρηματικής ευφυΐας (business intelligence)
Automatic extraction, transformation, and visualization of business data with business intelligence technologies

View/ Open
Keywords
Business intelligence ; EDGAR ; Airbyte ; dbt ; Power BI ; ELT ; Quarter Bucket ; IBCSAbstract
In this undergraduate thesis, the design and implementation of an automated system for the extraction, transformation, and visualization of business data is presented, using Business Intelligence (BI) technologies. The primary data source is the public Securities and Exchange Commission's Electronic Data Gathering, Analysis, and Retrieval (SEC EDGAR) repository. In contrast, the overall data pipeline is implemented using open-source tools, specifically Airbyte (for data extraction and ingestion), PostgreSQL (for data storage), a data build tool (dbt) (for data transformation and processing), and Power BI (for reporting and interactive dashboards). To address issues related to temporal gaps, data asynchrony, and delayed financial disclosures, the “Quarter Bucket” methodology is applied, enabling fair and consistent comparisons across calendar quarters. The results of the study demonstrate the effectiveness of the proposed architecture in terms of identity stability (IDs), performance improvement through incremental/merge strategies, and the production of analytical reports that adhere to the International Business Communication Standards (IBCS), incorporating neutrality principles for the treatment of small variations.


