The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Evaluation of researchers is a big issue in institutional research. We propose a method for quantitatively evaluating the stage of young, middle and senior researchers focusing on the role of the last author in co-authored papers. We trace the two time series of the number of published papers and the ratio of the last authored papers among them of each researcher. We conducted experiments on 84 researchers...
When texts are mined for meaningful information, one important aspect is to construct a coding rule that categorizes key terms into several conceptual groups. Usually such a rule is human-made and tends to be subjective. The present study attempts to build coding rules automatically from the ISO~26000 document and compares the results with those obtained by creating the coding rules manually.
A survey of related work is an important task for every researcher, and databases of scientific articles are indispensable for this task. This paper proposes a new visualization method for search results and demonstrates a system that implements this method. Given a query, the system returns a list of articles and displays a time series of citation counts (CCs) for each article. The novelty of the...
The literature survey by scientific bibliographic data base is indispensable in the research activities. We can find related articles with appropriate keywords. However, the threads of related research are not easy to grasp from the search result. It is necessary to repeat a search, judge a citation relation and figure out the thread. The present paper proposes the index "accumulated citation...
Literature survey of scientific articles depends on the relevancy and the quality of the obtained list. Relevancy might be controlled by an appropriate search query and the relevancy ranking of the search result. Citation count (CC) is widely used and useful as an easy measure to evaluate the quality of articles. However, articles with high citation count might cover a wide area, while they might...
Individual opinions and experiences are published in Web as CGM (consumer generated media). A tourism blog which a tourist wrote his experience and impression in a certain area is very helpful information for other tourists. However, a user cannot obtain such precious information without knowing the relation of blog articles and concrete place-names. We paid our attention to the hierarchical structure...
Blog articles by tourists contain interesting and personal experiences of where and how they have gone, what they have done and what they thought. Such individual experiences are helpful in many cases compared to the general and official information about the tourist resort by tourist agents. However, it is not easy to choose related articles and to extract still more nearly required information from...
Corporate analysis is needed for various purposes such as finding a good business partner or a good employment, as well as choosing a good investment. Conventionally, it has been based mainly on financial figures. Recent advances in natural language processing technology, however, has activated studies on analysis of non-financial, textual data. This paper tries to predict the growth rate of the operating...
Blog articles by tourists contain interesting and personal experiences of where and how they have gone, what they have done and what they thought. Such individual experiences are helpful in many cases compared to the general and official information about the tourist resort by tourist agents. However, it is not easy to choose related articles and to extract still more nearly required information from...
We herein investigate finding unusual patterns from a given string as a text. In the present paper, the pattern is expressed as a sub string of the string. The natural assumption with respect to the frequency of a pattern is that the shorter the length of the pattern, the larger the frequency of the pattern. We define a pattern to be pure if the frequencies of all of the sub strings of the pattern...
When starting new research or summarizing the results of research, it is necessary to review related work in the same research field. The research review requires several point of views such as ``problem'', ``method'', ``result''. Simple search by keywords is not effective to specify and narrow-down the scope of search for these meta purpose. In this paper, we focus on sentences to improve the efficiency...
Various viewpoints are required to make a survey and a trend analysis on related research. In order to find important problems, especially in unfamiliar field, simple search and clustering is not enough. We have to read most of the articles carefully. The work requires a lot of time and effort. This paper analyzes the sentences that describe the problem using SVM. It turned out the negative words...
By development of the Internet in recent years, tourism portal sites and blog articles about tourism increased on WWW. Acquisition of various tourism information became easy. When gathering and classifying the information automatically from blog articles, it is not easy to decide automatically place names used as the key. In this paper, we propose a method of extracting place names from blog articles...
This paper targets the students, who are just beginning to engage in research. With the data-mining technologies, using the data of KAKEN (Grant-in-Aid for Scientific Research of Japan), according to students' learning styles and learners' knowledge levels, we propose to create a "Learning by Searching" search engine to provide suitable knowledge and help students to master research trends...
We are developing a concept dictionary for Petit Trips (Short Time Trips). Based on the concept dictionary, we propose a smart phone-based system to support the Petit Trips. This system provides travelers a Classification Network of tourist spots according to their locations. While travelers select their preferred tourist spots, the system will recommend the suitable tourist spots which have similar...
A lot of information concerning the status of companies are available on the Web. However, a simple search of documents does not explain the meaning or the cause the status. Semantical interpretation and hypotheses generation are necessary for further analysis. This paper proposes a method to analyse the cause and the situation of bankruptcy with respect to particular condition that a user can specify...
In recent years, with the development of the Internet, and the rapidly increasing number of tourism portal sites and blogs, we can obtain a variety of tourist information on the Internet. If we have a specific need, we can obtain the required information through checking the retrieved results one by one. However, in order to see the whole trend of the results, it is necessary to analyze and visualize...
A wrapper is a program that selectively extracts a necessary part (component) from Web pages. Automatic or semi-automatic wrapper construction is crucial to achieve a fine grained search engine for Web pages. However, this is not an easy task to achieve. This paper proposes a component-based search engine in which the content components gain a high score in the search results. Thus, the required segments...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.