Adventures of a multidimensional freak

This is Juan Julián Merelo Guervós English-language blog. He teaches computer science at the University of Granada, in southern Spain. Come back here to read about politics, technology, with a new twist

Latest comments

  • seoexpert en Spanish blogosphere in Wired
  • pioneerseo en Spanish blogosphere in Wired
  • seoexpert en Spanish blogosphere in Wired
  • seoexpert en Spanish blogosphere in Wired
  • seoexpert en Algorithm::Evolutionary 0.56 released
  • seoexpert en Spanish blogosphere in Wired
  • seoexpert en The importance of the community
  • seoexpert en The fall of the empire bridge
  • seoexpert en PPSN: On quality of papers and so forth
  • seoexpert en Äppärätti in Super Sad True Love Story
  • Blogs Out There

    Nelson Minar's Blog
    Jeremy Zawodny's Blog
    Complexes, Carlos Gershenson's blog
    IlliGAL, Genetic Algorithms blog
    Blogging in the wind, Víctor R. Ruiz's blog

    Atalaya, my Spanish language blog
    Geneura@Wordpress, our research group's blog.
    My home page

    Old stories

    Creative Commons License
    This work is licensed under a Creative Commons License.

    Inicio > Historias > Automatic detection of trends in time-stamped sequences: an evolutionary approach available online

    Automatic detection of trends in time-stamped sequences: an evolutionary approach available online

    Our last paper on modelling time streams such as the streams of comments to this site, Blogalia, which I have coauthored with Dr. Lourdes Araújo,is available online. From the abstract:
    This paper presents an evolutionary algorithm for modeling the arrival dates in time-stamped data sequences such as newscasts, e-mails, IRC conversations, scientific journal articles or weblog postings. These models are applied to the detection of buzz (i.e. terms that occur with a higher-than-normal frequency) in them, which has attracted a lot of interest in the online world with the increasing number of periodic content producers. That is why in this paper we have used this kind of online sequences to test our system, though it is also valid for other types of event sequences. The algorithm assigns frequencies (number of events per time unit) to time intervals so that it produces an optimal fit to the data. The optimization procedure is a trade off between accurately fitting the data and avoiding too many frequency changes, thus overcoming the noise inherent in these sequences. This process has been traditionally performed using dynamic programming algorithms, which are limited by memory and efficiency requirements. This limitation can be a problem when dealing with long sequences, and suggests the application of alternative search methods with some degree of uncertainty to achieve tractability, such as the evolutionary algorithm proposed in this paper. This algorithm is able to reach the same solution quality as those classical dynamic programming algorithms, but in a shorter time. We also test different cost functions and propose a new one that yields better fits than the one originally proposed by Kleinberg on real-world data. Finally, several distributions of states for the finite state automata are tested, with the result that an uniform distribution produces much better fits than the geometric distribution also proposed by Kleinberg. We also present a variant of the evolutionary algorithm, which achieves a fast fit of a sequence extended with new data, by taking advantage of the fit obtained for the original subsequence.

    If Springerlink is not available in your institution, please email me for a copy. It's been really a long time from the initial version, with 3 revisions (and even more if you include the versions that were sent, and rejected, from other journals), but finally it's been published. Now, on to the next one...

    Etiquetas: , , ,

    2009-03-18 09:54 | 1 Comment(s) | Filed in Research

    Referencias (TrackBacks)

    URL de trackback de esta historia


    De: happy wheels Fecha: 2018-06-18 11:37

    I have been seeking information on this topic for the past few hours and found your post to be well written and has solid information. Thank you!
    happy wheels

    Dirección IP: (2395238c33)
    ¿Cuánto es: diez mil + uno?

    © 2002 - 2008 jmerelo
    Powered by Blogalia