Skip to main content

Mi Gente Event Data Dictionary v1

This document is the repository copy of the Mi Gente Data Dictionary and the authoritative mapping for workers/playwright-source-scanner/supabase/migrations/20260530000000_migente_event_data_dictionary_v1.sql.

Source-of-truth scope

  • Canonical tables: public.migente_events, public.migente_event_editions, public.migente_venues, public.migente_social_links, public.migente_tickets, public.migente_hotels, public.migente_competitions, public.migente_artists, public.migente_djs, public.migente_schedule_summary, public.migente_event_characteristics, public.migente_cost_estimates, public.migente_passport, public.migente_achievements, public.migente_event_dna.
  • Seed file: workers/playwright-source-scanner/supabase/seed.sql.
  • Validation output for service-role-only operational use: public.migente_worldwide_congress_candidate_view.
  • Public page output flows through the unified public.migente_public_routes_view, public.migente_public_listings_view, and public.migente_public_listing_details_view projections after canonical promotion.

Implementation alignment and assumptions

  1. is_core_corpus_candidate on migente_events and is_core_corpus_eligible on migente_event_editions represent the active global corpus filter.
  2. Core corpus inclusion rule is enforced in SQL:
    • event type must be one of: Congress, Weekender, Festival, Marathon, Retreat, Cruise, Dance Vacation
    • edition must be explicitly marked eligible
    • eligible edition must be multiday (end_date > start_date) and non-null
    • bootcamps are excluded unless governance rules are changed later.
  3. Candidate scope is designed for multiday, internationally useful congress-family events. Single-day bootcamps are kept for archival research but not included in world corpus scoring.

Required follow-up

  • Add missing actor-level provenance for source_url and last_verified_date updates.
  • Add phase-2 join tables for artist/DJ appearances before enabling recommendations.
  • Add recommendation engine and analytics artifacts as Phase-3 work, consistent with this schema baseline.