brain-score · mschrimpf · Jan 2, 2024 · Nov 6, 2023 · Nov 8, 2023 · Dec 18, 2023
diff --git a/...sion/submission/brainscore_submission.png → .../source/modules/brainscore_submission.png b/...sion/submission/brainscore_submission.png → .../source/modules/brainscore_submission.png
diff --git a/docs/source/modules/submission.rst b/docs/source/modules/submission.rst
@@ -1,12 +1,119 @@
 Submission
 ----------
 
-The submission package implements automatic evaluation of models submitted on http://brain-score.org.
-It also implements standard wrappers which use model-tools_ to convert standard models from machine learning
-into brain models (i.e. implementing the model-interface_).
-
-.. automodule:: brainscore.submission
-    :members:
-    :undoc-members:
-.. _model-tools: https://github.com/brain-score/model-tools
-.. _model-interface: https://github.com/brain-score/brain-score/blob/master/brainscore/model_interface.py
+Submission system Components
+#################
+
+
+
+To provide an automatic scoring mechanism for artificial models of the brain, Brain-Score has implemented a entire
+ecosystem, with each part explained below:
+
+.. image:: docs/source/modules/brainscore_submission.png
+    :width: 200px
+    :align: center
+    :height: 100px
+    :alt: alternate text
+
+- **Brain-Score Website**:
+
+    The `website <www.brain-score.org>`_ implementation, also found on
+    `Github <https://github.com/brain-score/brain-score.web>`_, handles our front end. It is implemented using Django
+    and also accesses the database instance. When user submit their models in form of a zip file, the website triggers
+    the execution with an http request to Jenkins (see below), also delivering the submitted zip file. Users can also
+    see their profile, personal (private) leaderboard as well as the public leaderboard.
+
+    The website is hosted via Amazon Elastic Beanstalk. There are three insteances on AWS EB:
+
+    - `brain-score-web-dev <brain-score-web-dev.us-east-2.elasticbeanstalk.com>`_: This is our dev website environment.
+      It is the same as prod, but is a retired branch.
+
+    - `brain-score-web-prod <brain-score-web-prod.us-east-2.elasticbeanstalk.com>`_: Our old (retired) logacy branch
+      of prod. This was used from 6/2020 - 6/2023, and was replaced by the updated environment below.
+
+    - `Brain-score-web-prod-updated <http://brain-score-web-prod-updated.kmk2mcntkw.us-east-2.elasticbeanstalk.com>`_:
+      Our current, live website. This is what United Domains uses to load the actual Brain-Score site.
+
+
+ - **Jenkins**:
+
+    `Jenkins <http://braintree.mit.edu:8080/>`_ is a continuous integration tool, which we use to automatically run
+    project unittests and the scoring process for models of the brain. Jenkins is running on Braintree - DiCarlo lab's
+    internal server. Jenkins defines different jobs and executes different tasks. The task for a new submission is
+    triggered via the website and the unittest tasks are triggerd by GitHub web hooks. Once the jobs are triggered,
+    Jenkins runs a procedure to execute the tests or scoring and communicate the results back to the user (via Email)
+    or back to GitHub.
+
+-  **Openmind**
+
+    As scoring submissions is a computationally and memory expensive process, we cannot execute model scoring on small
+    machines. We submit jobs to Openmind, a computer cluster operated by MIT BCS. The big advantage of Openmind is its
+    queuing system, which allows to define detailed resource requirements. Jobs are executed once their requested
+    resources are available. The Jenkins related contents are stored on ``/om5/group/dicarlo/jenkins``. This directory
+    contains a script for model submission (`score_model.sh`) and for unittests (`unittests_brainscore.sh`). The scripts
+    are executed in an Openmind job and are responsible for fully installing a conda environment, executing the process,
+    and shutting everything down again. Results are stored in the database or copied to amazon S3 cloud file system.
+    From there, Jenkins reports the results back to its caller (the website) and emails the user.
+
+
+- **Postgres database:**
+
+    Our database, hosted on Amazon AWS, contains all displayed score and submission data, along with much more user and
+    model data/metadata. Our AWS account contains three database instances:
+     - Prod (brainscore-prod-ohio-cred): This database is used in production mode, containing real user's data. This
+       database should not be altered for development until features have been tested and vetted on Dev.
+     - Dev (brainscore-1-ohio-cred): A development database, which can be used to develop new database dependent
+       features. Nothing will break when the database schema here is changed; it is periodically updated to match Prod.
+     - Test (brainscore-ohio-test): The database used for executing tests. Jenkins also executes unittests of all
+       Brain-Score projects and should use this database for testing.
+
+    The names in parantheses are used in brain-score to load database credentials for the different databases.
+    Just change the name and another database is used. Databases are automatically snapshotted every 7 days, and
+    devs can restore snapshots at any time.
+
+
+
+
+
+
+What to do
+#################
+
+
+...when changing the database schema
+************************************
+The current schema is depicted `here
+<https://github.com/brain-score/brain-score/blob/master/brainscore/submission/db_schema.uml>`_:
+
+
+When the database schema has to be changed, use the `Brain-Score.web <https://github.com/brain-score/brain-score.web>`_
+project, along with django commands, to adjust the tables (in `benchmarl/models.py`). Once changes are made locally,
+see `here < https://github.com/brain-score/brain-score.web/blob/master/deployment.md#to-deploy>`_ to apply those
+migrations to the correct databases. All needed changes to the database (dev or prod) should be done with Django via
+migrations. During development, work with the dev database (secret `brainscore-1-ohio-cred`); when your
+tests pass on the test database (`brainscore-ohio-test`) they are ready for the PR. Once the PR is approved and test
+cases run, the PR can be merged. Finally, apply those migrations to the prod database via the link above.
+
+
+...changing the submission process
+**********************************
+In addition to the main job for scoring submission (run_benchmarks), Jenkins contains a second job (dev_run_benchmarks),
+which can be used to test new submission code. It is also a good idea instead of checking out the Brain-Score master
+branch, as it is done in the default job, to checkout your development branch instead. This way you can run a whole
+submission without harming the "production" job. This is accomplished already by a duplicate of score_models.sh for dev,
+aptly named dev_score_models.sh. That script is what is run on Jenkins's dev environment. Once the development job runs
+successfully, the code can be merged to master and will be run "in production".
+
+Scoring Process Description
+#################
+For scoring submitted files, we install the Brain-Score framework on Opemnind and run the scoring process. There are
+two types of submissions possible:
+   - First time submissions, submitting a zip file with new models to score.
+   - Resubmission of already scored models, which should be scored on updated/new benchmarks.
+
+To do so only a list of model IDs as stored in the database are required. For new submissions the delivered zip file is
+unpacked, the modules installed and models instantiated. The submitted modules must implement a clearly defined API,
+which is described in detail HERE. When the submitted module is formatted correctly, the process can extract the models
+and score them. Produced results are stored in the Score table of teh Database and in a .csv file. When old models
+should be scored on new benchmarks, the process installs (possibly multiple) past submission zip files and scores the
+models. Every submission and all scores are persisted in the database.