[Neo] Idea for project using neo4j - good idea?

Matt Brown msbcode at gmail.com
Mon Feb 8 21:40:00 CET 2010


Hi Peter, thanks for responding to my email. This example actually looks
exactly like what I was thinking about - I guess I'm not the only one with
such a basic idea :) Thanks again for the input!

On Mon, Feb 8, 2010 at 3:10 PM, Peter Neubauer <
peter.neubauer at neotechnology.com> wrote:

> Matt,
> take a look at the Site Usage Analytics example on the Neo4j Wiki,
> http://wiki.neo4j.org/content/Site_Usage_Analytics for some
> inspiration of possible node space layout. Would that be in the
> direction you are thinking?
>
> Cheers,
>
> /peter neubauer
>
> COO and Sales, Neo Technology
>
> GTalk:      neubauer.peter
> Skype       peter.neubauer
> Phone       +46 704 106975
> LinkedIn   http://www.linkedin.com/in/neubauer
> Twitter      http://twitter.com/peterneubauer
>
> http://www.neo4j.org                - Your high performance graph
> database.
> http://gremlin.tinkerpop.com    - The terminal to the Giant Global Graph.
>
>
>
> On Mon, Feb 8, 2010 at 4:13 PM, Matt Brown <msbcode at gmail.com> wrote:
> > Hi,
> >
> > I have an idea for a project for which I was considering using neo4j to
> > implement.
> >
> > Given several hundred http request logs (in NCSA format), I'd like to
> build
> > something that generates statistics on the paths that users take through
> the
> > website, something like "from page /a.htm, 50% of visitors proceed to
> > /b.htm, 13% proceed to /c.htm" - something similar to the data that
> Google
> > Analytics gives you. I'd also like to be able to trace the path that an
> > arbitrary SessionID took through the site, to recreate his/her visit.
> >
> > I don't have any actual experience with neo4j or graph databases, but I'm
> > thinking that I could model the various pages in the site as nodes and
> model
> > the path that various visitors took through the site as relationships
> > between the nodes. I'm just curious if anyone more familar with neo4j
> could
> > confirm if this would be a good use for it or not?
> >
> > I'm also wondering what would be the best way to store the paths that
> > individual users/sessionIDs took through the page nodes. If I have a
> million
> > unique sessions across all of my log files, will neo4j be able to support
> > having tens of thousands of relationships between page A and B? What
> would
> > be the best way to store the order of page "nodes" that a particular
> visitor
> > visited - would make this sense to add as attributes of each unique
> > relationship?
> >
> > And if I model things this way, will I be able to effectively search for
> the
> > two use cases I mentioned above (percentage breakdown of what pages were
> > visited next from page N, and the path through the page nodes that an
> > individual session took)?
> >
> > Thanks
> > Matt
> > _______________________________________________
> > Neo mailing list
> > User at lists.neo4j.org
> > https://lists.neo4j.org/mailman/listinfo/user
> >
> _______________________________________________
> Neo mailing list
> User at lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>


More information about the User mailing list