« StatBot » : différence entre les versions

De Wikipast
Aller à la navigation Aller à la recherche
Ligne 33 : Ligne 33 :


==Work Plan==
==Work Plan==
# Get birthdate from page
:: Load up the page of the entity
:: Get "Naissance" event date
:: Retrurn error code if the naissance event does not follow the correct syntaxe
# Add birthdate to the database file
:: Columns day month year for name

Version du 9 avril 2019 à 12:47

Mission Statement

Datapoints on named entities

  1. Birth year
  2. Life expectancy
    1. Add to entity page?
  3. Gender (maybe gender probability)
    1. Use name-gender bias
    2. Add to the entity page?
  4. Travel distance
    1. Solid angle encomposing all locations?
    2. Shortest path distance summed between locations specified in chronological order?
    3. Correlate location data with actual points in google maps?
  5. Profession
    1. Where can this information be retrieved?

Correlated Datapoints

  1. Relation score between two entities
    1. Using life time span, location overlap, referenced pages

Analysis

  1. Birth year distribution of the Wikipast database
  2. life expectancy of members in Wikipast
  3. Life expectancy at birth from Wikipast data
  4. Travel radius as function of Birth year
  5. Mean degrees of seperation between entities of Wikipast

Technicalities

Should the datapoints be stored in a page here on wikipast to be read from for analytics, or should it be kept in a local file? Should a biometrics section be added to the pages in order to store this gender data and other parameters?

Work Plan

  1. Get birthdate from page
Load up the page of the entity
Get "Naissance" event date
Retrurn error code if the naissance event does not follow the correct syntaxe
  1. Add birthdate to the database file
Columns day month year for name