Skip to content

Commit d2e1bcc

Browse files
authored
Merge pull request #3 from MicrosoftCloudEssentials-LearningHub/modular-approach
Modular approach
2 parents a352f1e + 3bb7563 commit d2e1bcc

File tree

15 files changed

+1010
-505
lines changed

15 files changed

+1010
-505
lines changed

README.md

Lines changed: 12 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ Costa Rica
88
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
99
[brown9804](https://github.com/brown9804)
1010

11-
Last updated: 2025-07-25
11+
Last updated: 2025-07-29
1212

1313
-----------------------------
1414

@@ -17,6 +17,8 @@ Last updated: 2025-07-25
1717
- Table structure and text are extracted using Azure Document Intelligence (Layout model).
1818
- Visual selection cues are detected using Azure AI Vision or image preprocessing.
1919
- Visual indicators are mapped to structured data, returning only the selected values in a clean JSON format.
20+
- Advanced semantic understanding is provided by Azure OpenAI to analyze document content and context.
21+
- Multiple file formats are supported, including PDFs and various image formats.
2022
- The logic is abstracted to support multiple layout variations, so the system adapts easily to new document formats and selection styles.
2123

2224
> [!IMPORTANT]
@@ -65,11 +67,14 @@ Last updated: 2025-07-25
6567

6668
</details>
6769

68-
> How to extract layout elements from PDFs stored in an Azure Storage Account, process them using Azure Document Intelligence, and store the results in Cosmos DB for further analysis.
70+
> `How can you extract layout, text, visual, and other elements` from `PDFs` stored in an Azure Storage Account, process them using Azure AI services, and `store the results` in Cosmos DB for `further analysis?` This solution is `designed to accelerate the process` of building your own implementation. Please `feel free to use any of the provided reference.` I'm happy to contribute. Once this solution is deployed:
6971
>
70-
> 1. Upload your PDFs to an Azure Blob Storage container. <br/>
71-
> 2. An Azure Function is triggered by the upload, which calls the Azure Document Intelligence Layout API to analyze the document structure. <br/>
72-
> 3. The extracted layout data (such as tables, checkboxes, and text) is parsed and subsequently stored in a Cosmos DB database, ensuring a seamless and automated workflow from document upload to data storage.
72+
> 1. Upload your documents: Just `drop your PDFs or images into an Azure Storage container`and the system takes over from there.
73+
> 2. Automated intelligent processing: Behind the scenes, `Azure Functions orchestrates a powerful AI workflow`:
74+
> - Document Intelligence pulls out tables, text, and form data
75+
> - AI Vision spots visual cues like checkmarks and highlights
76+
> - Azure OpenAI understands what the document actually means
77+
> 3. Centralized information management: `All extracted data is stored in Cosmos DB`, organized and accessible. The system `adapts to differents document layouts without requiring custom code for each format.`
7378
7479
> [!NOTE]
7580
> Advantages of Document Intelligence for organizations handling with large volumes of documents: <br/>
@@ -447,7 +452,7 @@ Last updated: 2025-07-25
447452

448453
<!-- START BADGE -->
449454
<div align="center">
450-
<img src="https://img.shields.io/badge/Total%20views-1447-limegreen" alt="Total views">
451-
<p>Refresh Date: 2025-07-25</p>
455+
<img src="https://img.shields.io/badge/Total%20views-1616-limegreen" alt="Total views">
456+
<p>Refresh Date: 2025-07-29</p>
452457
</div>
453458
<!-- END BADGE -->

0 commit comments

Comments
 (0)