共查询到19条相似文献,搜索用时 187 毫秒
1.
从提高Web用法挖掘系统整体运行效率的角度出发,优化设计Web用法挖掘数据方案;通过细化采集工作,实施简化待采集信息元集合,扩展信息元标识功能,在信息抽象基础上对信息进行分类提交和存储,进行分布式数据预处理等策略,使得在高质量完成数据采集工作的基础上,系统的存储效率、性能平衡、解析与转储效率也得到明显提升。 相似文献
2.
3.
随着Web信息的急剧膨胀 ,它需要高效的信息采集工具来完成信息资源的采集。智能化Web信息采集系统能够对Web信息进行自动采集、分类 ,并对信息搜索和浏览提供必要的支持。 相似文献
4.
利用ISI Web of Knowledge信息平台和TDA、Cite Space、Pajek分析软件对2000-2011年间SCI收录的微生物燃料电池论文进行多角度、多侧面的分析与比较,通过国家或地区分布、主要研究机构、期刊分布、经典文献等,从文献计量分析的角度揭示国际上微生物燃料电池相关研究领域近年来的研究现状与发展趋势。 相似文献
5.
基于信息计量学研究的目的,对Web信息资源规模范围的定量测度、Web信息资源变化情况的计量分析等问题进行了初步研究和探讨。同时,综述了有关Web文档及其网络链接所服从文献计量学分布模型的理论研究进展。 相似文献
6.
7.
利用信息可视化分析软件CitespaceⅢ对从Web of Science中下载的1999—2013年间电子商务信用研究领域的文献题录数据进行了分析。从文献数量、文献作者、文献来源机构和国家的可视化分析角度展示了该研究领域的核心作者、机构及国家。 相似文献
8.
由于因特网和web都是开放、变化、非结构化、动态无序的海量信息资源组织,所以对于网络信息数据的采集和质量控制成为网络计量学领域集中研究的热点问题。本文针对网络信息数据采集的质量控制问题进行了比较全面的研究,内容涉及网络检索时段的统一测定,Web网页及Web网站的抽样设计,避免重复采集网页和优先搜集重要网页的方法,以及面向主题进行特定信息采集的技术等。 相似文献
9.
10.
计量监测主要是对电能计量数据进行实时监测与分析管理,从用户现场安装的采集装置中提取现场的用电信息,通过对这些现场用电数据进行综合对比及分析,并根据一定的判断条件来得出用户购电、用电情况是否正常的结论。 相似文献
11.
关于网络信息老化研究的若干问题 总被引:13,自引:1,他引:13
网络信息老化研究的主要任务是围绕研究目标,寻找网络信息老化的测度指标,采用数学、统计学等各种定量方法对网络信息老化进行定量描述和统计分析,以便揭示网络信息老化数量特征和内在规律,建立相应的数学模型并提出理论解释体系。因而包括:①网络信息老化测度研究;②网络信息老化规律研究;③网络信息老化规律的应用研究等内容。 相似文献
12.
Web信息检索系统中的网页质量分析方法评价 总被引:1,自引:0,他引:1
改进对高质量网页的检索精度,将会极大提高Web信息检索系统的用户满意度。首先提出了信息检索中的“有用性”指标,并据此论述了基于网页质量分析方法的Web信息检索模型,然后提出了网页质量直接测度指标和网页质量间接测度指标。最后,详细介绍了各种网页质量指标的相关研究内容和方法,并做出了针对性的评价。 相似文献
13.
14.
Comparing rankings of search results on the Web 总被引:1,自引:0,他引:1
The Web has become an information source for professional data gathering. Because of the vast amounts of information on almost all topics, one cannot systematically go over the whole set of results, and therefore must rely on the ordering of the results by the search engine. It is well known that search engines on the Web have low overlap in terms of coverage. In this study we measure how similar are the rankings of search engines on the overlapping results.We compare rankings of results for identical queries retrieved from several search engines. The method is based only on the set of URLs that appear in the answer sets of the engines being compared. For comparing the similarity of rankings of two search engines, the Spearman correlation coefficient is computed. When comparing more than two sets Kendall’s W is used. These are well-known measures and the statistical significance of the results can be computed. The methods are demonstrated on a set of 15 queries that were submitted to four large Web search engines. The findings indicate that the large public search engines on the Web employ considerably different ranking algorithms. 相似文献
15.
This paper presents not only mycommunityinfo.ca (MCI) as an innovative World Wide Web (WWW)-based community information (CI) site, but also how its unique approach to facilitating online CI searching on the Web reveals through empirical data how people use such information and communication technologies (ICTs) to address their everyday information needs. The geographic focus for this study is on three communities in Southwestern Ontario. MCI collects unobtrusively query data that are logged daily from its own Web site, the Web sites of three municipal governments, and one municipal agency from this region. One year’s worth of these data was supplied to determine the types of CI that are sought through Web searching. A content analysis of a large purposive sample of all of MCI’s query data reveals more specific and diverse conceptual CI needs between and within communities than those reported in other studies employing different data collection methods. As a result, using a centralized approach to online CI access via the WWW by other CI providers such as the 211 network may be a disservice to its users. Additionally, the findings demonstrate how a thorough analysis of such data may improve the informational content and overall design of municipal government Web sites. The analysis of these data also has the potential of improving current CI taxonomies. 相似文献
16.
文章以信息质量和系统质量作为主要测量维度,构建了基于预期-比较范式的网站用户信息满意测评模型,然后采集数据,运用PLS结构方程建模方法对网站用户信息满意测评模型进行验证,获得网站用户信息满意的影响因素、影响程度的有关结论,为网站用户信息满意测评提供指导。 相似文献
17.
When information wanted to be free: Discursive bifurcation of information and the origins of Web 2.0
Eran Fisher 《The Information Society》2018,34(1):40-48
In the 1990s the aphorism “information wants to be free” reigned supreme, limiting our thinking in consequential ways. In actuality this aphorism was a fragment of a much more nuanced statement by Steward Brand, who also talked about “information wants to be expensive.” It seemed for quite a while that there was no resolution to the contradiction: information as both free and expensive. Eventually Web 2.0 resolved this contradiction by providing an architecture where information could be both free and expensive. Web 2.0 was not a product of technological advances: social media, wikis, big data platforms, and so forth. It was borne out of the understanding that free information on media platforms could yield profitable data on users. This article lays bare the discursive moves through which this understanding came about. 相似文献
18.
基于情报检索的汉语同义词识别初探 总被引:3,自引:0,他引:3
随着计算机的飞速发展,自然语言越来越广泛地应用于情报检索,同义词控制问题也成为情报学的研究热点。本文提出一种识别同义词的方法,这种方法基于检索网络搜索得到的统计数据,使用Dice测度方法测量两个词的相关度,相关度在给定的阈值内就可以认为是同义词。通过分析测试结果,验证这种方法的可行性,并提出了这种方法的优缺点及其应用。 相似文献