'Similar artists' data for musical artists

  1.  
    1. Hi there, 

      I have a data set with for about 38,000 artists with a MusicBrainz ID a list of the artists that are most related to it according to last.fm.
      I have about 60,000 more without a MusicBrainz ID and I can obviously retrieve more data through the last.fm webservices.

      My plan is to add all artists with a MusicBrainz ID that are more than 80% similar to an artist (about two or three artists, usually) as a 'similar artist' relation. Is that ok?

      As an example, I added links for about 100 artists to the sandbox. I do my lookups based on the MusicBrainz ID, and do not create new artists. See: Édith Piaf

      Any comments? Should I somehow link the artists to their last.fm page after processing? Add their last.fm urlname as a key, perhaps?

      Thanks,

      - Jeroen

      1. Thanks, Jeroen. First, please be sure that the data can be contributed legally; we did not collect the artist similarity information from MusicBrainz, for instance, because it is licensed CreativeCommons-Non-Commercial, and as a commercial enterprise, we can’t legally use that data.

        If you have permission to load that data, then this sounds like a great idea! You could also add the last.fm page as a Web link. For lookup, since last.fm and Freebase both use MusicBrainz keys, a key is probably not needed

      2. Ugh, right, I forgot to check the NonCommercial part. Never mind...



    Discussion is posted in:

    Think this discussion also relates to something else? Cross-post it by adding a new discussion area: