ESA Datalabs Ariel Hackathon 2025

Task

Participants will collaborate in teams (4 - 5 people) to devise machine learning-driven solutions aimed at converting noisy observations (referred to as lightcurves) into transmission spectra. The objective is to approximate the ground truth spectra as closely as achievable. The winning team(s) will be announced as the victor(s) of the hackathon and will receive a prize.

Evaluation

The final winner will be determined by the highest score achieved after passing the solution to the score calculator. Check the Scoring Metrics tab for more information. The winner will be announced on the last day of the hackathon soon after the the submission deadline

Model Restrictions

There is no restriction on the models, algorithms or data preprocessing techniques, neither on the programming languages, environments or tools used for your implementation. You are also free to use data augmentation techniques, pretrained models or any prior domain knowledge not included in the provided dataset. Finally, you are free to choose your own way of splitting the training data between training and validation sets and to use as many of the provided datapoints or features as you wish – or can handle

The ML task to be solved is a supervised learning one, and more particularly a multi-targets regression problem.

Features:

Features: Each training datapoint consists of a set of 55 noisy light curves (one per wavelength, each being a timeseries of 300 timesteps) and a set of 6 additional stellar and planetary parameters. All these are real numbers. For more details on what a light curve is and what we are modelling, go to the Science page.

Targets:

The goal is to predict a set of 55 real values (relative radii, one per wavelength) for any given datapoint (lightcurve).

Event Rules

1. Each team must have a unique alias.
2. The organisers will use the provided code, models and solutions only for the purposes of checking whether the contest rules (4) & (5) are not broken. The organisers will not use any result without the authors’ permission.
3. Participants must not use TauREx 3 or similar atmospheric retrieval codes to retrieve the test-set forward models. This would be considered test-set leakage and against the spirit of the competition.
4. Participants must not have access to the heldout testset's ground truth before the competition’s closing date. If they do, they will be disqualified.
5. For an entry to count as a winning entry, it must not rely heavily (as judged by the organisers) on hard-coded elements that are solely deemed to be due to test-set leakage.
6. The organisers reserve the right to interpret and enforce the above rules at their discretion to maintain a respectful and inclusive environment for all participants.

In the case of a draw

Should two or more top-ranked participants have the same final score, we will require the participants to submit their algorithms. This is to check that no plagiarism has occured.

Reasons for disqualification

In case of plagiarism, forced test set leakage, failure to produce a description of the solution when requested or to conform with the multiple accounts or the team submissions policies, a participant will be disqualified and the next-in-rank participant will be considered in their place.

To evaluate a new solution (model), the participants are requested to upload its predictions on the test dataset via the score calulcator page. See the Data Formats page for information on the upload format. A score will then be automatically calculated for the solution.

The score reported is based on the average of the absolute error per target (i.e. on the relative radii) across all test set examples $i$ and all wavelengths $j$ and is given by:

$$Score = 10^4 - \frac{\sum_{i \in Test} \sum_{j=1}^{55} w_{ij} 2 y_{ij} |\hat{y}_{ij} - y_{ij}|}{\sum_{i \in Test} \sum_{j=1}^{55} {w_{ij}}}10^6,$$

where $$y_{ij}$$ is the true relative radius and $$\hat{y}_{ij}$$ the predicted relative radius of the $$$j$-th$$ wavelength of the $$$i$-th$$ test set example. $$w_{ij}$$ is treated as uniform in this hackathon, meaning every wavelengths and test examples are treated as equally weighted.

The higher the score, the better your ranking. The maximum achievable score is 10000. The score is not lower-bounded (i.e. can be negative), but reasonable attempts (e.g. predicting the average target value for all test datapoints) should not produce scores below 4000.