You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+9-4Lines changed: 9 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,6 +17,8 @@ Last updated: 2025-07-25
17
17
- Table structure and text are extracted using Azure Document Intelligence (Layout model).
18
18
- Visual selection cues are detected using Azure AI Vision or image preprocessing.
19
19
- Visual indicators are mapped to structured data, returning only the selected values in a clean JSON format.
20
+
- Advanced semantic understanding is provided by Azure OpenAI to analyze document content and context.
21
+
- Multiple file formats are supported, including PDFs and various image formats.
20
22
- The logic is abstracted to support multiple layout variations, so the system adapts easily to new document formats and selection styles.
21
23
22
24
> [!IMPORTANT]
@@ -65,11 +67,14 @@ Last updated: 2025-07-25
65
67
66
68
</details>
67
69
68
-
> How to extract layoutelements from PDFs stored in an Azure Storage Account, process them using Azure Document Intelligence, and store the results in Cosmos DB for further analysis.
70
+
> `How can you extract layout, text, visual, and other elements` from `PDFs` stored in an Azure Storage Account, process them using Azure AI services, and `store the results` in Cosmos DB for `further analysis?` This solution is `designed to accelerate the process` of building your own implementation. Please `feel free to use any of the provided reference.` I'm happy to contribute. Once this solution is deployed:
69
71
>
70
-
> 1. Upload your PDFs to an Azure Blob Storage container. <br/>
71
-
> 2. An Azure Function is triggered by the upload, which calls the Azure Document Intelligence Layout API to analyze the document structure. <br/>
72
-
> 3. The extracted layout data (such as tables, checkboxes, and text) is parsed and subsequently stored in a Cosmos DB database, ensuring a seamless and automated workflow from document upload to data storage.
72
+
> 1. Upload your documents: Just `drop your PDFs or images into an Azure Storage container `and the system takes over from there.
73
+
> 2. Automated intelligent processing: Behind the scenes, `Azure Functions orchestrates a powerful AI workflow`:
74
+
> - Document Intelligence pulls out tables, text, and form data
75
+
> - AI Vision spots visual cues like checkmarks and highlights
76
+
> - Azure OpenAI understands what the document actually means
77
+
> 3. Centralized information management: `All extracted data is stored in Cosmos DB`, organized and accessible. The system `adapts to differents document layouts without requiring custom code for each format.`
73
78
74
79
> [!NOTE]
75
80
> Advantages of Document Intelligence for organizations handling with large volumes of documents: <br/>
0 commit comments