Extraction of interesting financial information from heterogeneous XML-based data

Juryon Paik, Young Ik Eom, Ung Mo Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

XML is going to be the main language for exchanging financial information between businesses over the Internet. As more and more banks and financial institutions move to electronic information exchange and reporting, the financial world is in a flood of information. With the sheer amount of financial information stored, presented and exchanged using XML-based standards, the ability to extract interesting knowledge from the data sources to better understand customer buying/selling behaviors and upward/downward trends in the stock market becomes increasingly important and desirable. Hence, there have been growing demands for efficient methods of discovering valuable information from a large collection of XML-based data. One of the most popular approaches to find the useful information is to mine frequently occurring tree patterns. In this paper, we propose a novel algorithm, FIXiT,for efficiently extracting maximal frequent subtrees from a set of XML-based documents. The main contributions of our algorithm are that: (1) it classifies the available financial XML standards such as FIXML, FpML, XBRL, and so forth with respect to their specifications, and (2) there is no need to perform tree join operations during the phase of generating maximal frequent subtrees.

Original languageEnglish
Title of host publicationComputational Science - ICCS 2006
Subtitle of host publication6th International Conference, Proceedings
PublisherSpringer Verlag
Pages356-363
Number of pages8
ISBN (Print)3540343857, 9783540343851
DOIs
StatePublished - 2006
EventICCS 2006: 6th International Conference on Computational Science - Reading, United Kingdom
Duration: 28 May 200631 May 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3994 LNCS - IV
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceICCS 2006: 6th International Conference on Computational Science
Country/TerritoryUnited Kingdom
CityReading
Period28/05/0631/05/06

Fingerprint

Dive into the research topics of 'Extraction of interesting financial information from heterogeneous XML-based data'. Together they form a unique fingerprint.

Cite this