« StatBot » : différence entre les versions

De Wikipast
Aller à la navigation Aller à la recherche
Ligne 33 : Ligne 33 :


==Work Plan==
==Work Plan==
# Get birthdate from page
===Get Naissance from page===
::: Argument (page_name), Returns (Name, birthplace, birth day , month, year) use 0 if not present
:: Load up the page of the entity
:: Load up the page of the entity
:: Get "Naissance" event date
:: Get "Naissance" event date (use first hyperword for name)
:: Retrurn error code if the naissance event does not follow the correct syntaxe
:: Retrurn error code if the naissance event does not follow the correct syntaxe
# Add birthdate to the database file
===Database write ===
:: Columns day month year for name
::: Arguments (page_name , dictionary('parameter', value)), Returns (success or failure)
:: Columns for each variable name  
===Page Write===
:::

Version du 9 avril 2019 à 13:03

Mission Statement

Datapoints on named entities

  1. Birth year
  2. Life expectancy
    1. Add to entity page?
  3. Gender (maybe gender probability)
    1. Use name-gender bias
    2. Add to the entity page?
  4. Travel distance
    1. Solid angle encomposing all locations?
    2. Shortest path distance summed between locations specified in chronological order?
    3. Correlate location data with actual points in google maps?
  5. Profession
    1. Where can this information be retrieved?

Correlated Datapoints

  1. Relation score between two entities
    1. Using life time span, location overlap, referenced pages

Analysis

  1. Birth year distribution of the Wikipast database
  2. life expectancy of members in Wikipast
  3. Life expectancy at birth from Wikipast data
  4. Travel radius as function of Birth year
  5. Mean degrees of seperation between entities of Wikipast

Technicalities

Should the datapoints be stored in a page here on wikipast to be read from for analytics, or should it be kept in a local file? Should a biometrics section be added to the pages in order to store this gender data and other parameters?

Work Plan

Get Naissance from page

Argument (page_name), Returns (Name, birthplace, birth day , month, year) use 0 if not present
Load up the page of the entity
Get "Naissance" event date (use first hyperword for name)
Retrurn error code if the naissance event does not follow the correct syntaxe

Database write

Arguments (page_name , dictionary('parameter', value)), Returns (success or failure)
Columns for each variable name

Page Write