« StatBot » : différence entre les versions

De Wikipast
Aller à la navigation Aller à la recherche
Ligne 37 : Ligne 37 :
:: Load up the page of the entity
:: Load up the page of the entity
:: Get "Naissance" event date (use first hyperword for name)
:: Get "Naissance" event date (use first hyperword for name)
:: Retrurn error code if the naissance event does not follow the correct syntaxe
:: Return error code if the naissance event does not follow the correct syntaxe
 
===Database write ===
===Database write ===
::: Arguments (page_name , dictionary('parameter', value)), Returns (success or failure)
::: Arguments (page_name , dictionary('parameter', value)), Returns (success or failure)
:: Columns for each variable name  
:: Columns for each variable name  
===Databas Read===
:: Check if the entry for page_name already exists and overwrite it
===Database Load column===
:::Arguments (Column name list), Returns nd list consisting of all the data in the order specified in column name list argument
===Database Read entry===
:::Arguments (page_name), returns dictionary of all properties from the entry number


===Page Write===
===Page Write===
In Statbot section of the entity page
In Statbot section of the entity page
:::
::: Arguments(page_name, property:value)
:: Check if existing and modify
:: If section as a whole does not exist then create it


===ll===
===ll===

Version du 9 avril 2019 à 19:38

Mission Statement

Datapoints on named entities

  1. Birth year
  2. Life expectancy
    1. Add to entity page?
  3. Gender (maybe gender probability)
    1. Use name-gender bias
    2. Add to the entity page?
  4. Travel distance
    1. Solid angle encomposing all locations?
    2. Shortest path distance summed between locations specified in chronological order?
    3. Correlate location data with actual points in google maps?
  5. Profession
    1. Where can this information be retrieved?

Correlated Datapoints

  1. Relation score between two entities
    1. Using life time span, location overlap, referenced pages

Analysis

  1. Birth year distribution of the Wikipast database
  2. life expectancy of members in Wikipast
  3. Life expectancy at birth from Wikipast data
  4. Travel radius as function of Birth year
  5. Mean degrees of seperation between entities of Wikipast

Technicalities

Should the datapoints be stored in a page here on wikipast to be read from for analytics, or should it be kept in a local file? Should a biometrics section be added to the pages in order to store this gender data and other parameters?

Work Plan

Get Naissance from page

Argument (page_name), Returns (Name, birthplace, birth day , month, year) use 0 if not present
Load up the page of the entity
Get "Naissance" event date (use first hyperword for name)
Return error code if the naissance event does not follow the correct syntaxe

Database write

Arguments (page_name , dictionary('parameter', value)), Returns (success or failure)
Columns for each variable name
Check if the entry for page_name already exists and overwrite it

Database Load column

Arguments (Column name list), Returns nd list consisting of all the data in the order specified in column name list argument

Database Read entry

Arguments (page_name), returns dictionary of all properties from the entry number

Page Write

In Statbot section of the entity page

Arguments(page_name, property:value)
Check if existing and modify
If section as a whole does not exist then create it

ll