Skip to main content

Schema & Key Analytics

Your schema, under surveillance. Continuous monitoring of your data’s shape. DataHarbor learns schema over time and surfaces emerging risks automatically.

How it works

As Virtual APIs are used, DataHarbor:
  1. Learns schema shape over time
  2. Detects net-new fields as they appear
  3. Flags uncovered keys that may contain sensitive data
  4. Identifies risky combinations that enable re-identification
  5. Surfaces compliance violations

Net-new field detection

When your upstream API adds new fields, DataHarbor notices:
{
  "alert": "new_field_detected",
  "virtual_api": "partner-view",
  "field": "emergency_contact_phone",
  "first_seen": "2025-01-15T10:30:00Z",
  "sample_value_type": "string",
  "suggested_action": "review_for_pii"
}
New fields are not automatically protected. You receive an alert to review and apply appropriate controls.

Uncovered key detection

DataHarbor flags fields that may contain sensitive data but have no controls applied:
{
  "alert": "uncovered_sensitive_field",
  "virtual_api": "partner-view",
  "field": "drivers_license",
  "confidence": 0.92,
  "reason": "Field name matches known PII pattern",
  "suggested_control": "redact"
}

Risky combination detection

Some field combinations enable re-identification even when individual fields seem safe:
{
  "alert": "risky_combination",
  "virtual_api": "analytics-view",
  "fields": ["zip_code", "date_of_birth", "gender"],
  "risk": "high",
  "reason": "Combination can identify 87% of US population",
  "reference": "Sweeney 2000",
  "suggested_action": "anonymize_or_generalize"
}

Compliance detection

Automatic detection of potential compliance violations:
RegulationDetection
GDPR Article 4(1)Personal data exposed without consent basis
CCPACalifornia resident PII without disclosure
HIPAAPHI fields without BAA coverage
{
  "alert": "compliance_risk",
  "virtual_api": "partner-view",
  "regulation": "GDPR",
  "article": "4(1)",
  "issue": "email field exposed to non-EU processor",
  "suggested_action": "apply_tokenization_or_consent"
}

Dashboard view

The Schema Analytics dashboard shows:
  • Schema coverage: Percentage of fields with controls applied
  • Risk score: Aggregate risk based on uncovered and risky fields
  • Field inventory: Complete list of observed fields with control status
  • Alert history: Timeline of detected issues

Configuring alerts

monitoring:
  alerts:
    new_fields: true
    uncovered_sensitive: true
    risky_combinations: true
    compliance_violations: true
  
  notifications:
    email:
      - security@yourcompany.com
    webhook:
      url: https://yourcompany.com/webhooks/dataharbor

Next steps