Chris Pollett > Students >
Amith

    ( Print View )

    [Bio]

    [Project Blog]

    [CS297 Proposal]

    [Del 1: OPIC Algorithm implementation]

    [Del 2: SALSA Algorithm & Nutch]

    [Del 3: Nutch implementation]

    [Del 4: HITS Algorithm implementation]

    [CS297 Report - PDF]

    [CS298 Proposal]

    [CS298 Final Report - PDF]

    [CS298 Project Source Code - ZIP]

                          

























Deliverable 1


Figure 1: Snapshot of OPIC Algorithm demo

This deliverable simulates the working of the Online Page Importance Calculation (OPIC) Algorithm. This tool was developed using Javascript and uses 10 * 10 static nodes to depict the working of a static network. A connection of nodes is activated / deactivated by selecting 1/0 on the adjacency matrix.

The basic working of OPIC algorithm can be defined as: Initially, some cash is distributed to each page and each page when it is crawled distributes its current cash equally to all pages it points to. Cash can be defined as the numerical value allotted to each page. The static nodes of the matrix represent the web pages. This is recorded in the history of the page. The importance of a page is then obtained from the credit history of the page. The idea is that the flow of cash through a page is proportional to its importance. At each step, an estimate of any page k's importance is (H[k]+C[k]) / (G+1), where H[k] represents history, C[k] represents cash and G is the total cash accumulated. The algorithm is executed over decided number of iterations until the acceptable rate of convergence is achieved.

OPIC simulation tool

    OPIC Javascript tool