Skip to content

Latest commit

 

History

History
217 lines (133 loc) · 7.96 KB

reports.md

File metadata and controls

217 lines (133 loc) · 7.96 KB

Reports

A report is a collection of similar metrics. Reports may span multiple crawls, in which case the data is visualized as a timeseries. Reports may also focus on one snapshot of a particular crawl, in which case the data is visualized as a histogram.

Configuring a Report

Reports are configured in config/reports.json. Here's a small sample configuration:

{
  "_reports": [
    "foo",
    "bar"
  ],
  "_lens": {
    "wordpress": {
      "name": "WordPress"
    }
  },
  "_featured": [
    "bar"
  ],
  "_metrics": {
    "x": {
      "name": "X Metric",
      "type": "KB"
    },
    "y": {
      "name": "Y Metric",
      "type": "ms"
    },
    "z": {
      "name": "Z Metric",
      "type": "Requests",
      "redundant": true
    }
  },
  "foo": {
    "name": "Foo",
    "summary": "Lorem ipsum",
    "metrics": [
      "x",
      "y"
    ]
  },
  "bar": {
    "name": "Bar",
    "summary": "Lorem ipsum",
    "minDate": "2017_06_01",
    "maxDate": "2017_08_15",
    "metrics": [
      "x",
      "z"
    ]
  }
}

In this example config, there are two reports: Foo and Bar. They both include X Metric, but only Bar is featured on the home page. The Bar report is also date-capped between June 1, 2017 and August 15, 2017, meaning that the report UI will not allow the user to browse metrics outside of this time window.

Manifest Structure

  • reports.json

    JSON-encoded object mapping report IDs to report configs. Also includes metadata whose property names are preceded by an underscore.

    • _reports

      Required array of report IDs. Defines the sequence of reports as they appear on the reports page.

    • _featured

      Required array of report IDs. Defines the sequence of reports as they appear on the home page. This array should only contain 1-3 reports.

    • _lens

      Optional object mapping lens IDs to lens configs. Defines the available lenses as they appear on the report pages.

      • lens ID

        Short identifier string for the lens. Used as the URL value of the "lens" querystring parameter on report pages. Must be unique and match the lens/* SQL filenames, eg "wordpress".

      • lens config

        Maps lens configuration property names to values. The following properties are available:

        • name

          Required string. Human-readable name of the lens, eg "WordPress". Used in the report filtering UI.

    • _metrics

      Required object mapping metric IDs to metric configs.

      • metric ID

        Short identifier string for the metric. Used as the URL search fragment for the chart on the report page. Must be unique and match the histogram/timeseries SQL filenames. May be reused between reports.

      • metric config

        Maps metric configuration property names to values. The following properties are available:

        • name

          Required string. Human-readable name of the metric, eg "Time to First Paint". Must be unique to the report. Used in the title of the charts.

        • type

          Required string. Human-readable units of measurement. Common examples: "ms", "KB", "Requests". Used to label the chart axis and in chart tooltips.

        • singular

          Optional string. Human-readable units of measurement when the value is equal to 1. Common examples: "second", "Request", "library". Used to label the chart axis and in chart tooltips.

        • description

          Optional string. Human-readable explanation of what the metric is measuring and how to interpret the results.

        • downIsBad

          Optional boolean. Indicates that the metric going down is a bad thing. Defaults to false. For example all timing metrics should be as low (fast) as possible. Other metrics, like percent of HTTPS adoption, should be increasing, so this config should be set to true.

        • downIsNeutral

          Optional boolean. Indicates that the metric going down is not necessarily either good nor bad. Defaults to false.

        • histogram

          Optional object. Includes histogram-specific configuration options.

          • minDate

            Optional string. The earliest date at which the metric is available. YYYY_MM_DD format. For example, any metric that depends on the har dataset must have a minDate value of at least 2016_01_01, which is when the first HAR table became available.

          • maxDate

            Optional string. The latest date at which the metric is available. YYYY_MM_DD format.

          • enabled

            Optional boolean. Default true. Whether the metric should be included in histogram reports.

        • timeseries

          Optional object. Includes timeseries-specific configuration options.

          • enabled

            Optional boolean. Default true. Whether the metric should be included in timeseries reports.

          • fields

            Optional array. Default ["p10", "p25", "p50", "p75", "p90"]. Defines the field names of the measurement objects that should be plotted. A measurement object is just an element in the array of JSON-encoded timeseries data, containing client, date, and timestamp info as well as the measurement data.

          • redundant

            Optional boolean. Default false. Whether the metric type is redundant with the metric name. For example, set to true when the type is included in the name, like "Total Requests". The type will be omitted anywhere it follows the metric name.

        • wpt

          Optional object. Includes config info for extracting WebPageTest results.

          • path

            Required string. The '$'-delimited object path indicating where to find the metric in raw WebPageTest results.

          • scale

            Optional number. Multiplier for scaling a WebPageTest result to match the units used by HTTP Archive. For example, to convert bytes to KB, a scale of 0.001 divides the value by 1000.

    • report ID

      Short identifier string for the report. Used as the URL fragment for the report page. For example, the JavaScript report with ID js would be accessed at /reports/js. Changing this value will probably break permalinks.

    • report config

      Maps report configuration property names to values. The following properties are available:

      • name

        Required string. Human-readable title of the report. Must be unique.

      • summary

        Required string. Human-readable description of the report. Suggested length: 1-5 sentences.

      • metrics

        Required array. Describes the metrics included in the report.

      • minDate

        Optional string. The earliest date at which the report is available. YYYY_MM_DD format.

      • maxDate

        Optional string. The latest date at which the report is available. YYYY_MM_DD format.

      • datePattern

        Optional string. Regular expression pattern of dates for which the report is available. For example, ".*_01$" matches only the first crawl of the month.

      • maxDateMetric

        Optional string. Metric ID used to query the GCS bucket. The maxDate for the report will be the most recent date-bucket that contains this metric's JSON results.

      • view

        Optional string. Specifies the layout of metrics. For example, setting this to "grid" will force the report into grid view by default. The default value is "list".

Rendering a Report

The main.py web server config file loads dates.json and reports.json into memory on startup and refreshes their contents according to the MAX_REPORT_STALENESS constant, which is set to 3 hours. This helps to limit file IO on each request.

If there is only one distinct date, the server renders the report with the histogram template, otherwise it uses the timeseries template.

The report scripts will build the CDN URL based on the metric name and optional date. After fetching the JSON data, it will be massaged into a consumable format for Highcharts and a plain data table.