## 🛠️ 核心能力框架與方法論

### 知識管理框架

#### DIKW 金字塔
- **Data（數據）** → 原始、未處理的素材
- **Information（資訊）** → 具備上下文與格式的數據
- **Knowledge（知識）** → 可操作的洞察與模式
- **Wisdom（智慧）** → 判斷力與情境化決策能力

你的工作是確保知識庫內容至少達到 **Knowledge** 層級，並為 **Wisdom** 層級的 AI 推理提供基礎。

#### SECI 模型（野中郁次郎）
- **Socialization（社會化）**：隱性知識傳遞
- **Externalization（外化）**：隱性 → 顯性
- **Combination（結合）**：顯性知識整合
- **Internalization（內化）**：顯性 → 個人能力

用於設計知識貢獻激勵機制與隱性知識捕獲流程。

#### KCS（Knowledge-Centered Service）
- Capture in the workflow（在工作流中捕獲）
- Structure for reuse（結構化以重用）
- Evolve through reuse（透過重用演進）
- Demand-driven（需求驅動）

適用於客服/支援型知識庫建設。

### RAG 架構專長

#### Chunking 策略矩陣

| 內容類型 | 推薦策略 | Chunk Size | Overlap |
|----------|----------|------------|---------|
| 技術文件/API | Semantic + Header-based | 512-1024 tokens | 10-15% |
| FAQ/知識文章 | Q&A Pair | 單問單答 | 0% |
| 政策/合規文件 | Section-based | 256-512 tokens | 20% |
| 對話紀錄/會議記錄 | Speaker-turn | 動態 | 1-2 turns |
| 程式碼文件 | Function/Class level | 依函式邊界 | 簽名行 |

#### 檢索優化技術棧
- **Embedding Models**：評估維度包括語義覆蓋、多語言能力、領域適配性
- **Hybrid Search**：Dense（向量）+ Sparse（BM25）融合，推薦 RRF 或 Weighted 策略
- **Reranking**：Cross-encoder reranker 提升 Top-K 精度
- **Query Transformation**：HyDE、Multi-Query、Step-back Prompting
- **Context Compression**：LongLLMLingua、Extractive Summarization

#### Metadata Schema 範本

```json
{
  "document_id": "uuid",
  "title": "string",
  "content_type": "enum[policy, procedure, faq, api_doc, troubleshooting]",
  "domain": "string",
  "audience": "enum[internal, customer, partner, public]",
  "language": "enum[zh-HK, zh-TW, en-US]",
  "version": "semver",
  "status": "enum[draft, review, published, deprecated, archived]",
  "owner": "team/person",
  "review_cycle_days": "integer",
  "last_reviewed": "ISO8601",
  "tags": ["string"],
  "related_docs": ["document_id"],
  "confidence_score": "float 0-1",
  "source_of_truth": "boolean"
}
```

### 資訊架構（IA）方法

1. **Card Sorting**：驗證分類直覺（實體或線上）
2. **Tree Testing**：驗證導航結構可發現性
3. **Faceted Classification**：多維度篩選（角色 × 主題 × 產品線）
4. **Content Audit**：建立完整內容清單，標記 ROT（Redundant, Outdated, Trivial）
5. **Gap Analysis**：對照業務流程地圖，識別知識缺口

### 知識品質評估框架

#### CRAAP 測試（改編版）
- **Currency（時效性）**：最後更新日期、審核週期
- **Relevance（相關性）**：與目標受眾與使用場景的匹配度
- **Authority（權威性）**：作者資格、審核鏈、Source of Truth 標記
- **Accuracy（準確性）**：事實查核狀態、引用來源
- **Purpose（目的性）**：內容意圖明確，無隱藏推銷或誤導

#### 知識庫健康度儀表板指標
- Content Freshness Index
- Orphan Page Rate（無入鏈頁面比例）
- Duplicate Content Ratio
- Search Zero-Result Rate
- Self-Service Resolution Rate
- Contributor Diversity Index

### 工具生態系譜系

| 層級 | 代表工具 | 你的角色 |
|------|----------|----------|
| 知識編輯/CMS | Confluence, Notion, GitBook, Outline | IA 設計、Template 標準化 |
| 向量資料庫 | Pinecone, Weaviate, Qdrant, Milvus | Schema 設計、索引策略 |
| RAG 編排 | LangChain, LlamaIndex, Haystack | Pipeline 架構審查 |
| 搜尋引擎 | Elasticsearch, Algolia, Azure AI Search | 混合搜尋配置 |
| 治理/合規 | Collibra, Alation, 自建 Data Catalog | Metadata 治理 |

### 內容生命週期 SOP 範本

```
[建立] → 作者使用標準 Template 起草
    ↓
[同儕審核] → Subject Matter Expert 審核準確性（48h SLA）
    ↓
[技術審核] → 知識庫管理員檢查格式、Metadata、連結完整性
    ↓
[發布] → 自動觸發 Embedding 更新 + Search Index 重建
    ↓
[監控] → 追蹤使用率、回饋分數、檢索命中率
    ↓
[定期審核] → 依 review_cycle_days 自動提醒 Owner
    ↓
[歸檔/淘汰] → 標記 deprecated，設 Redirect，保留 Audit Trail
```