Skip to content

Commit ef699fe

Browse files
authored
Merge pull request #15 from MicrosoftCloudEssentials-LearningHub/presidio-notes
Update README with PII management references
2 parents 605f6a7 + 80158e3 commit ef699fe

File tree

13 files changed

+35
-29
lines changed

13 files changed

+35
-29
lines changed

0_RAG_Overview.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -147,7 +147,7 @@ graph LR
147147

148148
<!-- START BADGE -->
149149
<div align="center">
150-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
151-
<p>Refresh Date: 2025-09-09</p>
150+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
151+
<p>Refresh Date: 2025-09-18</p>
152152
</div>
153153
<!-- END BADGE -->

1_PrivateArchitecture.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -289,7 +289,7 @@ graph TD
289289

290290
<!-- START BADGE -->
291291
<div align="center">
292-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
293-
<p>Refresh Date: 2025-09-09</p>
292+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
293+
<p>Refresh Date: 2025-09-18</p>
294294
</div>
295295
<!-- END BADGE -->

2_ZeroTrustOverview.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,7 @@ From [Zero Trust Architecture Deployment](https://github.com/Azure/GPT-RAG?tab=r
8282

8383
<!-- START BADGE -->
8484
<div align="center">
85-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
86-
<p>Refresh Date: 2025-09-09</p>
85+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
86+
<p>Refresh Date: 2025-09-18</p>
8787
</div>
8888
<!-- END BADGE -->

3_PricingDemos/0_Doc-Intellig.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -57,7 +57,7 @@ Last updated: 2025-08-18
5757

5858
<!-- START BADGE -->
5959
<div align="center">
60-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
61-
<p>Refresh Date: 2025-09-09</p>
60+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
61+
<p>Refresh Date: 2025-09-18</p>
6262
</div>
6363
<!-- END BADGE -->

3_PricingDemos/1_AI-Search.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ Last updated: 2025-08-18
6666

6767
<!-- START BADGE -->
6868
<div align="center">
69-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
70-
<p>Refresh Date: 2025-09-09</p>
69+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
70+
<p>Refresh Date: 2025-09-18</p>
7171
</div>
7272
<!-- END BADGE -->

3_PricingDemos/2_AOAI-AIFoundry.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -169,7 +169,7 @@ This means with 50 PTUs, you can process 75,000 tokens per minute.
169169

170170
<!-- START BADGE -->
171171
<div align="center">
172-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
173-
<p>Refresh Date: 2025-09-09</p>
172+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
173+
<p>Refresh Date: 2025-09-18</p>
174174
</div>
175175
<!-- END BADGE -->

3_PricingDemos/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -138,7 +138,7 @@ graph TD
138138
139139
<!-- START BADGE -->
140140
<div align="center">
141-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
142-
<p>Refresh Date: 2025-09-09</p>
141+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
142+
<p>Refresh Date: 2025-09-18</p>
143143
</div>
144144
<!-- END BADGE -->

4_Which_Copilot_toChoose.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -156,7 +156,7 @@ From [Cloud computing - Service Models: SAAS, PAAS, IAAS - Which Is Better For B
156156
157157
<!-- START BADGE -->
158158
<div align="center">
159-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
160-
<p>Refresh Date: 2025-09-09</p>
159+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
160+
<p>Refresh Date: 2025-09-18</p>
161161
</div>
162162
<!-- END BADGE -->

README.md

Lines changed: 11 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -7,14 +7,19 @@ Costa Rica
77
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
88
[brown9804](https://github.com/brown9804)
99

10-
Last updated: 2025-09-09
10+
Last updated: 2025-09-18
1111

1212
----------
1313

1414
<details>
1515
<summary><b>List of References</b> (Click to expand)</summary>
1616

1717
- [GPT-RAG Orchestrator](https://github.com/Azure/gpt-rag-orchestrator) - also in charge of sql on prem/Sharepoint
18+
- `How we managed PII?`
19+
- [Detect and redact Personally Identifying Information in text](https://learn.microsoft.com/en-us/azure/ai-services/language-service/personally-identifiable-information/how-to/redact-text-pii): If you're building on existing solutions and want to add layers via APIs or SDKs, consider using Azure AI language tools to identify and extract Personally Identifiable Information (PII).
20+
- [Presidio - Data Protection and De-identification SDK](https://github.com/microsoft/presidio): When you need an `open-source option for detecting and managing sensitive data, Presidio` is a great fit. It can be `integrated into your systems to detect, redact, mask, and anonymize PII across text, images, and structured data.`
21+
- [How data is protected and audited in Microsoft 365 and Microsoft 365 Copilot](https://learn.microsoft.com/en-us/copilot/microsoft-365/microsoft-365-copilot-architecture-data-protection-auditing): If you're working with Microsoft 365 Copilot and need to understand its data protection model, this overview explains how it handles security and compliance.
22+
- [Considerations to manage Microsoft 365 Copilot and Channel Agent in Teams for security and compliance](https://learn.microsoft.com/en-us/purview/ai-m365-copilot-considerations): When managing compliance across Copilot deployments, Microsoft Purview offers tools to help govern data securely.
1823
- [Azure Developer CLI commands overview](https://learn.microsoft.com/en-us/azure/developer/azure-developer-cli/azd-commands)
1924
- [Azure Developer CLI (azd)](https://learn.microsoft.com/en-us/azure/developer/azure-developer-cli/)
2025
- [What is PowerShell?](https://learn.microsoft.com/en-us/powershell/scripting/overview?view=powershell-7.5)
@@ -49,6 +54,7 @@ Last updated: 2025-09-09
4954

5055
> [!NOTE]
5156
> How to query from `Sharepoint Library`: [GPT-RAG Data Ingestion](https://github.com/Azure/gpt-rag-ingestion/tree/main) <br/>
57+
>
5258
> - Access & Authentication: Integration uses a `service principal accoun`t registered in Azure Entra ID to authenticate and access the SharePoint document library via Microsoft Graph API. This avoids using personal accounts for programmatic access.
5359
> - Data Ingestion Flow: The RAG system connects to the SharePoint library using the provided credentials, retrieves documents (mainly PDFs), and processes them for indexing.
5460
> - Code Structure: Key integration logic resides in files such as:
@@ -57,8 +63,8 @@ Last updated: 2025-09-09
5763
> - For each file: download, check if changed, chunk, and index.
5864
> - Tracks total, succeeded, and failed counts, and logs progress every N files.
5965
> - Uses in-memory tracking of failures for review.
60-
> - Handles API interaction: [sharePoint.py](https://github.com/Azure/gpt-rag-ingestion/blob/main/tools/sharepoint.py)
61-
> - Documents are chunked using a `common logic module, not specific to SharePoint`, to prepare data for embedding and indexing: [document_chunking.py](https://github.com/Azure/gpt-rag-ingestion/blob/main/chunking/document_chunking.py)
66+
> - Handles API interaction: [sharePoint.py](https://github.com/Azure/gpt-rag-ingestion/blob/main/tools/sharepoint.py)
67+
> - Documents are chunked using a `common logic module, not specific to SharePoint`, to prepare data for embedding and indexing: [document_chunking.py](https://github.com/Azure/gpt-rag-ingestion/blob/main/chunking/document_chunking.py)
6268
> - Current Limitation: Only one `SharePoint library is supported at a time, but the code can be extended to support multiple libraries by creating additional indexes.`
6369
> - No Preview Connector: The integration `does not use the AI Foundry SharePoint preview connector; it relies on custom code developed before that connector was available.`
6470
> - Component Architecture: The `ingestion is handled by a function app, which takes files from SharePoint, converts them into vectors/embeddings, and stores them in the index for search`.
@@ -453,7 +459,7 @@ gpt-rag-resource-group: resource not found: 0 resource groups with prefix or suf
453459
454460
<!-- START BADGE -->
455461
<div align="center">
456-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
457-
<p>Refresh Date: 2025-09-09</p>
462+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
463+
<p>Refresh Date: 2025-09-18</p>
458464
</div>
459465
<!-- END BADGE -->

how-modify.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ Last updated: 2025-08-18
1616
1717
<!-- START BADGE -->
1818
<div align="center">
19-
<img src="https://img.shields.io/badge/Total%20views-1344-limegreen" alt="Total views">
20-
<p>Refresh Date: 2025-09-09</p>
19+
<img src="https://img.shields.io/badge/Total%20views-1428-limegreen" alt="Total views">
20+
<p>Refresh Date: 2025-09-18</p>
2121
</div>
2222
<!-- END BADGE -->

0 commit comments

Comments
 (0)