Data Sync
Manage all three data source sync pipelines
Database Status
Total Projects
0
Last Synced
—
Manual Sync
webrobots.ioMonthly full snapshot · 2016 to present
Downloads the latest webrobots.io dataset (~100 MB compressed, ~2M records) and writes to local SQLite. Sync takes 5–15 minutes — do not close the app.
Auto-sync: checks webrobots daily at 4am and only downloads when a new dataset is available.
KS Live Discovery
kickstarter.comReal-time new project discovery · fills snapshot gaps
Scrapes the Kickstarter discover page (sorted by newest) to find projects launched after the last webrobots snapshot.
Auto-runs every 2 hours in the background, fetching projects from the last 3 days.
Kicktraq Active Projects
kicktraq.comActive project supplement · third-party source
Scrapes the Kicktraq active projects list to supplement KS direct scraping coverage.
Auto-runs every 6 hours in the background.
Kicktraq Full Scan (one-time)
full scanCrawl all category pages to capture all currently active projects
Iterates all ~130 Kicktraq categories, paginates through each, and deduplicates against existing records. Estimated runtime: 15–30 minutes.
⚠️ Note: /archive/ is disabled by Kicktraq. This scan only covers currently active projects (~2800). Historical data relies on webrobots monthly snapshots.
About the Data
- Source: webrobots.io monthly Kickstarter full snapshots
- Format: CSV (ZIP-compressed), 20+ fields per project
- History: March 2016 to present, one snapshot per month
- Auto-sync: checks for a new dataset on server startup and daily at 4am
- Database: Local SQLite at data/kickstarter.db
- KS Live: auto-runs every 2h to discover new projects from Kickstarter discover page
- Kicktraq: auto-runs every 6h, deduplicates against existing records before inserting