You are browsing the archive for News.

Technological foundations of the current Blogosphere paper accepted at WIMS’12

6:30 am in Blog, News, Publications by Vangelis Banos

A paper named “Technological foundations of the current Blogosphere” has been accepted in the International Conference on Web Intelligence, Mining and Semantics (WIMS’12), to be held on 13-15 June 2012 in Craiova, Romania.

Authors: Vangelis Banos, Karen Stepanyan, Yannis Manolopoulos, Mike Joy and Alexandra Cristea

Abstract: In this paper, we review the technological foundations of the current Blogosphere. The review is primarily based on a large-scale evaluation of active blogs. The extensive list of examined technologies enables commenting on a range of widely adopted standards and potential trends in the Blogosphere. The evaluation has been conducted in the following stages:

  1. Retrieving and parsing a large set of blogs
  2. Identifying and quantifying the use of technologies such as web standards, adopted services, file formats and platforms.
  3. Analysing collected data and reporting the results
  4. Comparing the results with existing findings from the generic Web to identify similarities and differences in the Blogosphere.

The presented work was performed as part of BlogForever (ICT No. 269963), an EC funded research project aiming to aggregate, preserve, manage and disseminate blogs. The results of this study are relevant within the context weblog preservation and weblog data extraction.

Test the blog spider prototype

10:52 pm in Blog, News by Morten Rynning

Finally the first software delivery in the Blogforever project is available – the prototype of a blogosphere spider.

The spider enables crawling and monitoring lists of identified blogs as well as new, unknown blogs. ItThe  Any new blog posts or comments from each blog will be added to the feed through the spider.

The spider can be downloaded and run from a single server; and managed through a web portal interface, as seen in the figure below.

 Spider portal prototype 300x160 Test the blog spider prototype

 

 

 

 

 Figure 1 -Spider portal: Details linked from any of indexed sources can display crawled XML, and link to actual HTML.

Although this is only the first prototype of the research project, we have tested the prototype through crawling 36,000 distinct blogs, and extracting blog data of approximately 1GB.

The prototype spider can be downloaded and run from: http://bf2.csd.auth.gr/BFCrawler.rar.

Minimum server requirements to run the crawler:

  1. Operating System: Windows 2008 Server 64bit
  2. CPU: 2 Xeon CPUs 2.5Ghz
  3. RAM: 4 GB
  4. Hard Disk: 20GB (SAS) plus 60GB (SATA)

 Anyone interested in blog crawling should test this and contact us for discussing further requirements and usage.

Spider Prototype downloard 300x159 Test the blog spider prototype

 

 

 

 

 

Figure 2 - The downloader file: Contains download and installation instruction for the Blogforever Prototype Spider

 

2nd BlogForever Consortium Meeting

7:48 pm in Blog, News by Vangelis Banos

PYRGOS 300x191 2nd BlogForever Consortium MeetingThe 2nd BlogForever Consortium Meeting took place during 8-9 September in Thessaloniki, Greece. Nineteen participants from twelve institutions came to Thessaloniki to discuss about BlogForever. Current progress was evaluated and the project roadmap was laid down.

The meeting was organized in sessions covering all aspects of the project:

  • Weblog Structure and Semantics (WP2) was one of the main sessions of the meeting, covering recently submitted BlogForever Survey and the pending Blog Data Model.
  • The BlogForever Policies (WP3) section of the meeting covered work on Risk management as well as the Preservation Policy.
  • In the BlogForever software platform (WP4) session, work on User Requirements & Platform Specifications was evaluated. Additionally, a special technical session explored possible ways of designing & developing the BlogForever Platform.
  • Last but not least, the dissemination plan & associated activites were presented in the Dissemination & Exploitation (WP6) session.

Besides BlogForever partners, Carolyn Hank was also invited to present her work on Blog Preservation and contribute to expanding the spectrum of the project.

P1090108 1024x768 2nd BlogForever Consortium Meeting

 

The BlogForever survey is live!

3:17 pm in Blog, News by Silvia Arango-Docio

After weeks of design work, the BlogForever survey is live, available in 6 languages and running for 28 days. The results of the survey, available at the end of the summer, will help us to develop digital preservation, management and  dissemination  facilities for weblogs within the BlogForever project. Hence, we are keen to gather information from you about blog content, context and usage patterns of current weblogs, so we could identify your views on the long-term preservation, management, analysis, access and future use of the BlogForever Archive. We would appreciate if you could take part on the survey using the following link:

banner web 300x200 The BlogForever survey is live!

Thanks for participating!

Kick off Meeting

7:31 pm in News by Vangelis Banos

warwick Kick off MeetingThe kick-off meeting will be held in Warwick, UK on the 22nd and 23rd of March 2011 and the administrative workshop (for those who need it) will be held on the 21st of March.

About BlogForever

1:31 pm in News by Vangelis Banos

BLOGFOREVER will develop robust digital preservation, management and dissemination facilities for weblogs. These facilities will be able to capture the dynamic and continuously evolving nature of weblogs, their network and social structure, and the exchange of concepts and ideas that they foster; pieces of information omitted by current Web Archiving methodsand solutions.