Skip to main content

Guardoni documentation

Guardoni is a free software automated web scraper, used to collect data about platforms as Youtube and Tik Tok. It is a puppeteer-based tool integrating Tracking Exposed extensions on its browser. Data is gathered and copied from platforms into our database for later retrieval or analysis.

๐Ÿค Getting helpโ€‹

Having trouble? Weโ€™d like to help!

  • Feel free to ask questions by email.

  • Check the Github repo, report bugs or contribute. โค๏ธ

  • Join the community in Tracking Exposed Slack.

  • Send a Tweet

๐Ÿ›  Disclaimerโ€‹

Guardoni is still alpha stage, the software is not thoroughly tested by the developers yet, it may contain serious errors, and any resulting instability could cause crashes or data loss. Alpha software may not contain all of the features that are planned for the final version. In general, free software often has publicly available alpha versions.

๐Ÿ‘ฃ First stepsโ€‹

  • Guardoni at first sight

    Understand what Guardoni is and how it can help you.

  • Installation guide

    Get Guardoni installed on your computer.

  • Coming soon: Guardoni Tutorial

    Participate to our first experiment and learn how to create your first project.

  • Coming soon: Examples and ready made experiments

    Learn more by playing with a pre-made Tracking Exposed experiment.

๐Ÿ”ฌ How we collect dataโ€‹

Guardoni is not the direct responsible of collecting data while browsing, as it is, in simple words, just a wrapper around puppeteer with some automation features. The data scraping and collection is made by our browser extensions, loaded by Guardoni at bootstrap.

๐Ÿงช Experimentsโ€‹

Experiments are instructions prepared to perform a series of actions. They are csv files with a list of URLs and and watching time. They belong to an exploratory phase, allowing us to work on a common basis and compare differences.

๐Ÿน Design principlesโ€‹

  • User friendly. Guardoni should be easy to learn and use, users shall not feel overwhelmed. It should look intuitive and easy to build on top of, using approaches they are familiar with.

  • Optimal Resource Consumption. A wise, cost-efficient approach to system resources. Efficiency is one of the critical factors identifying high-quality softwares.

  • Interoperability. Common data formats and communication protocols to work with other products or systems.