Mi Gente Event Data Dictionary v1
This document is the repository copy of the Mi Gente Data Dictionary and the authoritative mapping for workers/playwright-source-scanner/supabase/migrations/20260530000000_migente_event_data_dictionary_v1.sql.
Source-of-truth scope
- Canonical tables:
public.migente_events,public.migente_event_editions,public.migente_venues,public.migente_social_links,public.migente_tickets,public.migente_hotels,public.migente_competitions,public.migente_artists,public.migente_djs,public.migente_schedule_summary,public.migente_event_characteristics,public.migente_cost_estimates,public.migente_passport,public.migente_achievements,public.migente_event_dna. - Seed file:
workers/playwright-source-scanner/supabase/seed.sql. - Validation output for service-role-only operational use:
public.migente_worldwide_congress_candidate_view. - Public page output flows through the unified
public.migente_public_routes_view,public.migente_public_listings_view, andpublic.migente_public_listing_details_viewprojections after canonical promotion.
Implementation alignment and assumptions
is_core_corpus_candidateonmigente_eventsandis_core_corpus_eligibleonmigente_event_editionsrepresent the active global corpus filter.- Core corpus inclusion rule is enforced in SQL:
- event type must be one of:
Congress,Weekender,Festival,Marathon,Retreat,Cruise,Dance Vacation - edition must be explicitly marked eligible
- eligible edition must be multiday (
end_date > start_date) and non-null - bootcamps are excluded unless governance rules are changed later.
- event type must be one of:
- Candidate scope is designed for multiday, internationally useful congress-family events. Single-day bootcamps are kept for archival research but not included in world corpus scoring.
Required follow-up
- Add missing actor-level provenance for
source_urlandlast_verified_dateupdates. - Add phase-2 join tables for artist/DJ appearances before enabling recommendations.
- Add recommendation engine and analytics artifacts as Phase-3 work, consistent with this schema baseline.