Python库|trueskillthroughtime-0.0.1.tar.gz资源-CSDN文库

版权申诉

8 浏览量 2022-03-11 16:12:56 上传评论收藏 8KB GZ 举报

共10个文件

txt：4个

pkg-info：2个

license：1个

资源推荐

资源详情

资源评论

收起资源包目录

trueskillthroughtime-0.0.1.tar.gz （10个子文件）

trueskillthroughtime-0.0.1

PKG-INFO 13KB

LICENSE 2KB

src

trueskillthroughtime.egg-info

PKG-INFO 13KB

requires.txt 6B

SOURCES.txt 275B

top_level.txt 1B

dependency_links.txt 1B

setup.cfg 102B

setup.py 1KB

README.md 11KB

# TrueSkillThroughTime.py None of the commonly used skill estimators, such as TrueSkill, Glicko and Item-Response Theory, correctly models the temporal aspect, which prevents having both good initial estimates and comparability between estimates separated in time and space. TrueSkill Through Time corrects those biases by modeling the entire history of activities using a single Bayesian network. The use of an efficient algorithm, that requires only a few linear iterations over the data, allows scaling to millions of observations in few seconds. A full scientific documentation is discribed at [TrueSkill Through Time: the Julia, Python and R packages](https://github.com/glandfried/TrueSkillThroughTime/). ### Parameters In the following code we define the variables that we will use later, assigning the default values of the packages. `mu = 0.0; sigma = 6.0; beta = 1.0; gamma = 0.03; p_draw = 0.0` ### Players With these default values we create four identical players. `a1 = Player(Gaussian(mu, sigma), beta, gamma); a2 = Player(); a3 = Player(); a4 = Player() ` The `Gaussian` class is used to model the standard operations of Gaussian distributions including multiplication, summation, division, and substraction. ### Games In the next step we create a game with two teams of two players. ``` team_a = [ a1, a2 ] team_b = [ a3, a4 ] teams = [team_a, team_b] g = Game(teams) ``` where the result of the game is implicitly defined by the order of the teams in the list: the teams appearing first in the list (lower index) beat those appearing later (higher index). During the initialization, the class `Game` computes the prior prediction of the observed result and the approximate likelihood of each player. ``` lhs = g.likelihoods[0][0] ev = g.evidence ev = round(ev, 3) print(ev) > 0.5 ``` In this case, the evidence is 0.5 indicating that both teams had the same probability of winning given the prior estimates. Posteriors can be found by manually multiplying the likelihoods and priors, or we can call the method `posteriors()` of class `Game` to compute them. ``` pos = g.posteriors() print(pos[0][0]) > Gaussian(mu=2.361, sigma=5.516) print(lhs[0][0] * a1.prior) > Gaussian(mu=2.361, sigma=5.516) ``` Due to the winning result, the estimate of the first player of the first now has a larger mean and a smaller uncertainty. We now analyze a more complex. The players are organized in three teams of different size: two teams with only one player, and the other with two players. The result has a single winning team and a tie between the other two losing teams. ``` ta = [a1] tb = [a2, a3] tc = [a4] teams = [ta, tb, tc] result = [1, 0, 0] g = Game(teams, result, p_draw=0.25) ``` the team with the highest score is the winner and the teams with the same score are tied. The evidence and the posteriors can be queried in the same way as before. ### Sequence of Events The class `History` is used to compute the posteriors and evidence of a sequence of events. In the first example, we instantiate the class with three players `"a", "b", "c"` and three games in which all agents win one game and lose the other. ``` c1 = [["a"],["b"]] c2 = [["b"],["c"]] c3 = [["c"],["a"]] composition = [c1, c2, c3] h = History(composition, gamma=0.0) ``` where the variables `c1`, `c2`, and `c3` model the composition of each game using the names of the agents (i.e. their identifiers), the variable `composition` is a list containing the three events, and the zero value of the parameter `gamma` specifies that skills does not change over time. After initialization, the class `History` immediately instantiates a new player for each name and activates the computation of the TrueSkill estimates, using the posteriors of each event as a prior for the next one. ``` lc = h.learning_curves() print(lc["a"]) > [(1, Gaussian(mu=3.339, sigma=4.985)), (3, Gaussian(mu=-2.688, sigma=3.779))] print(lc["b"]) > [(1, Gaussian(mu=-3.339, sigma=4.985)), (2, Gaussian(mu=0.059, sigma=4.218))] ``` The learning curves of players `"a"` and `"b"` contain one tuple per game played (not including the initial prior): each tuple has the time of the estimate as the first component, and the estimate itself as the second one. Although in this example no player is stronger than the others, the TrueSkill estimates present strong variations between players. TrueSkill Through Time solves this problem by allowing the information to propagate throughout the system by calling the method `convergence()`. ``` h.convergence() lc = h.learning_curves() print(lc["a"]) > [(1, Gaussian(mu=0.0, sigma=2.395)), (3, Gaussian(mu=-0.0, sigma=2.395))] print(lc["b"]) > [(1, Gaussian(mu=-0.0, sigma=2.395)), (3, Gaussian(mu=0.0, sigma=2.395))] ``` TrueSkill Through Time not only returns correct estimates (same for all players), they also have less uncertainty. ### Skill evolution This example will exhibit that TrueSkill Through Time can correctly follows the skill evolution of a new player taht joins a large community of already known players. In the following code, we generate the target player's learning curve and 1000 random opponents. ``` import math; from numpy.random import normal, seed; seed(99); N = 1000 def skill(experience, middle, maximum, slope): return maximum/(1+math.exp(slope*(-experience+middle))) target = [skill(i, 500, 2, 0.0075) for i in range(N)] opponents = normal(target,scale=0.5) ``` The list `target` has the agent's skills at each moment: the values start at zero and grow smoothly until the target player's skill reaches two. The list `opponents` includes the randomly generated opponents' skills following a Gaussian distribution centered on each of the target's skills and a standard deviation of 0.5. ``` composition = [[["a"], [str(i)]] for i in range(N)] results = [[1,0] if normal(target[i]) > normal(opponents[i]) else [0,1] for i in range(N)] times = [i for i in range(N)] priors = dict([(str(i), Player(Gaussian(opponents[i], 0.2))) for i in range(N)]) h = History(composition, results, times, priors, gamma=0.015) h.convergence() mu = [tp[1].mu for tp in h.learning_curves()["a"]] ``` In this code we define four variables to instantiate the class `History`: the `composition` contains 1000 games between the target player and different opponents; the `results` are obtained randomly, sampling the performance of the players; the `time` is a list of integer ranging from 0 to 999 representing the time of each game; and `priors` is a dictionary used to customize player attributes (we assign low uncertainty to the opponents' priors pretending that we know their skills beforehand). The Figure shows the evolution of the true (solid line) and estimated (dotted line) learning curves of the target player. ![synthetic](https://github.com/glandfried/TrueSkillThroughTime.py/blob/master/static/logistic0.png) The estimated learning curves remain close to the actual skill during the whole evolution. ### ATP History In this last example, we analyze the complete history of the Association of Tennis Professionals (ATP) registered matches. The database has 447000 games starting at year 1915 until 2020 with more than 19000 participating players and is publicly available. The file includes both single and double matches: if the column `double` has the letter `t`, the game is a double match. The file also contains players' identifiers and names: for example column `w2_id` is the identifier of the second player of the winning team and `l1_name` is the name of the first player of the losing team. ``` import pandas as pd; from datetime import datetime df = pd.read_csv('atp.csv') columns = zip(df.w1_id, df.w2_id, df.l1_id, df.l2_id, df.double) composition = [[[w1,w2],[l1,l2]] if d=='t' else [[w1],[l1]] for w1, w2, l1, l2, d in columns] days = [ datetime.strptime(t, "%Y-%m-%d").timestamp()/(60*60*24) for t in df.time_start] h = History(composition = composition, times = days, sigma = 1.6, gamma = 0.036) h.convergence(epsilon=0.01

评论收藏

内容反馈

版权申诉