• sqgl@beehaw.org
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      3 days ago

      That does not answer my question.

      No I don’t trust their methodology. In fact I am suspicious the numbers are entirely fictitious. I imagine they are highly guarded confidential information which is not public.

      • jarfil@beehaw.org
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        2 days ago

        You asked where do they get the data from… well, that’s the answer 🤷

        The numbers could be fictitious (you didn’t ask whether they get “reliable data”), or they could be doctoring them themselves… but there is a number of companies whose work is to let sites put trackers that gather user data, so they can in turn use it as a point when luring advertisers.

        It isn’t “highly guarded confidential” information, websites would happily submit their access logs if that could make them look more appealing to advertisers… but they don’t, because: a) they could be sending fake data, which would make the aggregating company lose face, meaning they won’t accept self-reported data, and b) site logs contain a lot of users’ personal information, sharing which could fall afoul of privacy legislation.

        They may still have to pay for access to parse that data, or extract it from the data made publicly available (…which could still be doctored, but 🤷)

        • sqgl@beehaw.org
          link
          fedilink
          arrow-up
          1
          ·
          edit-2
          2 days ago

          they can in turn use it as a point when luring advertisers.

          Wouldn’t that be shared only with potential advertisers upon request via a password rather than just making it publicly available?

          I am only speculating, are you?

          • jarfil@beehaw.org
            link
            fedilink
            arrow-up
            1
            ·
            2 days ago

            The intermediary companies also want to attract clients, whom to sell more detailed data. It isn’t unusual for them to release basic data like total number of users per domain, for free. For further segmentation, like interests, keywords, geolocation, client’s system properties, etc. they do require subscribing.

            I’m not speculating, I’ve had a chance to work at server maintenance (where basic data comes from), website design and maintenance (where 3rd part user trackers go), and both offering ad space and contracting ad services (dealing with these companies, ad networks, and website owners).