The Machine Learning Forum

CB Login

CB Workflows

CB Online

None
Drifting Games, Boosting, and Online learning. PDF Print E-mail
Written by Yoav Freund   
Saturday, 27 June 2009 13:15

Drifting Games, Boosting and Online Learning


Drifting games are mathematical games between a shepherd and a flock of sheep. The goal of the shepard is to get the sheep into a prescribed region - the stable. This game has been used by Freund to model problems in machine learning, specifically, to model boosting and online learning.

History

The first papers to use the ideas of drifting games were
  1. Boosting a weak learning algorithm by majority. Yoav Freund / Information and Computation / 1995
  2. On-line prediction and conversion strategies. Nicolò Cesa-Bianchi, Yoav Freund, David P. Helmbold, and Manfred K. Warmuth. / Machine Learning / 1996. (This paper presented the online algorithm called "Binomail weights" (BW) .)
These two papers use a special case of a drifting game where the location of the drifting elements is defined by an integer number. It turns out that the derived policies are min-max optimal when the number of sheep increases to infinity. Another more transparent way to consider this limit is to assume that the set sheep (examples in the boosting context, experts in the online learning context) is "continuous" or "arbitrarily divisible.

 

 

One important feature of the boost-by-majority algorithm is that it "gives up" on examples on which to many of the weak classifiers are wrong. This is a direct outcome of the fact that the number of boosting iterations is set in advance, resulting in a game with a finite horizon. This property does not hold for the better known boosting algorithms, Adaboost, Logitboost and their variants.


Schapire was the first to formalize the common ideas shared by Boost-by-Majority and the Binomial weights algorithms. He did so in this paper:

  • Drifting games. Robert E. Schapire. Machine Learning, 43(3):265-291, 2001.
Around the same time, Freund was working on an making the Boost-by-Majority algorithm adaptive. The goal of this work was to remove the requirement that the weak learner generates weak classifiers with a fixed a-priori known advantage over random guessing. The ability of Adaboost to adapt to weak classifiers with variable advantage was one of the important reasons that Adaboost has had great success in practice. The method used to make Boost by majority adaptive was to let the number of boosting iterations increase to infinity and operate in continuous time. The resulting algorithm was named "Brownboost" because it's analysis is based on Brownian motion.
In a followup paper, Opper and Freund combined the generality of drifting games and the continuous-time method used in Brownboost:

The brownboost algorithm did not prove itself in practice. The problem is that when shooting for an error that can be achieved on the training set, the weights assigned to the weak rules tends to increase to infinity. To avoid this from happening, the norm of the rule-weights vector needs to be controlled. This was achieved in Robustboost, by using a type of brownian motion that includes a drift term that pushes the weights towards zero. The Robustboost algorithm does seem to work in practice and is available as part of the JBoost software package. Robustboost is described in this paper:

Drifting games, Online learning and Tracking

Following up on the BW algorithm and applying the drifting games methodology to a more sophisticated loss function resulted in a new online learning algorithm, called Normalhedge. This algorithm is unique in that unlike the exponential weights online learning algorithms it does not have a learning rate parameter that needs to be tuned. It is therefor a truly universal online learning algoirthm for bounded losses. It achieves very similar bounds to the one achieved by exponential weights algorithms with optimal tuning.

Analyzing NormalHedge in continuous time yield much more transparent proofs and tighter bounds. However, it does require familiarity with stochastic process and the Ito integral.

NormalHedge can be applied to the tracking problem. This results in a new type of particle filter algorithm. This new algorithm has the advantage that it is based on much weaker assumptions than the standard particle filter and therefor gives better tracking when the underlying process that is generating the data is not known.
Comments (2)
  • daxue001  - dfdf
    Official Christian Louboutin Slingbacks
    Pro Bowl Christian Louboutin Slingbacks
    Super Bowl Christian Louboutin Slingbacks
    Christian Louboutin Boots
    Wholesale Christian Louboutin Boots
    China Christian Louboutin Boots
    Discount Christian Louboutin Boots
    Cheap Christian Louboutin Boots
    Authentic Christian Louboutin Boots
    Replica Christian Louboutin Boots
    Youth Christian Louboutin Boots
    Official Christian Louboutin Boots
    Pro Bowl Christian Louboutin Boots
    Super Bowl Christian Louboutin Boots
  • skinnymaggie  - silver tiffany
    All trades, arts, and handiworks have gained by division of labour, namely, when, instead of one man doing everything, each confines himself to a certain kind of work distinct from others in the treatment it requires,Silver Tiffany so as to be able to perform it with greater facility and in the greatest perfection. Tiffany JewelryWhere the different kinds of work are not distinguished and divided, where everyone is a jack-of-all-trades, Tiffany Braceletsthere manufactures remain still in the greatest barbarism. Tiffany NecklacesIt might deserve to be considered whether pure philosophy in all its parts Tiffany Earringsdoes not require a man specially devoted to it, and whether it would not be better for the whole business of science if those who, to please the tastes of the public, are wont to blend the rational and empirical elements together, mixed in all sorts of proportions unknown to themselves, and who call themselves independent thinkers, giving the name of minute philosophers...
Only registered users can write comments!

!joomlacomment 4.0 Copyright (C) 2009 Compojoom.com . All rights reserved."

Last Updated on Monday, 13 July 2009 07:19