Skip to main navigation Skip to search Skip to main content

Internet articles classification by industry types based on TF-IDF

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In order to understand a specific industry field, people usually look at the financial statements of the companies relevant to the industry field. Financial statements have diverse and numerical information but have past financial states of companies because those are usually quarterly reported. So, needs to timely obtain the current states of an industry field is increasing. Proposed method is focusing on internet articles because they are easy to obtain and updated with new information every day. As a preliminary study of extracting information on industries from internet articles, this paper proposes a method to classify internet articles by industry types. The proposed method in this paper computes importance values of nouns in internet articles based on TF-IDF. Using calculated importance values, proposed method classifies articles by industry types. Through experiments, it is proven that proposed method can achieve high accuracy in industry article classification.

Original languageEnglish
Title of host publicationAdvances in Computer Science and Ubiquitous Computing - CSA-CUTE 17
EditorsGangman Yi, Yunsick Sung, James J. Park, Vincenzo Loia
PublisherSpringer Verlag
Pages1121-1125
Number of pages5
ISBN (Print)9789811076046
DOIs
StatePublished - 2018
EventInternational Conference on Computer Science and its Applications, CSA 2017 - Taichung, Taiwan, Province of China
Duration: 18 Dec 201720 Dec 2017

Publication series

NameLecture Notes in Electrical Engineering
Volume474
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceInternational Conference on Computer Science and its Applications, CSA 2017
Country/TerritoryTaiwan, Province of China
CityTaichung
Period18/12/1720/12/17

Keywords

  • Classification
  • Industry
  • Internet article
  • TF-IDF

Fingerprint

Dive into the research topics of 'Internet articles classification by industry types based on TF-IDF'. Together they form a unique fingerprint.

Cite this