Hi moderators!
Our submission ID is 9726359. Could you please show the error logs? Thank you!
Zidan
Created by polalawang Hi @vchung
Thank you so much for your reply! This helps a lot!
Zidan
@polalawang ,
Here are the errors (and warnings) for submission ID 9726450:
```
Feature names seen at fit time, yet now missing:
- project_A
- project_B
- project_C
- project_D
- project_E
- ...
warnings.warn(message, FutureWarning)
Traceback (most recent call last):
File "/usr/local/bin/script.py", line 91, in
merged["probability"] = model.predict_proba(X)[:,1]
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_forest.py", line 874, in predict_proba
X = self._validate_X_predict(X)
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_forest.py", line 605, in _validate_X_predict
X = self._validate_data(X, dtype=DTYPE, accept_sparse="csr", reset=False)
File "/usr/local/lib/python3.9/site-packages/sklearn/base.py", line 600, in _validate_data
self._check_n_features(X, reset=reset)
File "/usr/local/lib/python3.9/site-packages/sklearn/base.py", line 400, in _check_n_features
raise ValueError(
ValueError: X has 9724 features, but RandomForestClassifier is expecting 9732 features as input.
```
A small reminder regarding the warning: the test set does not contain the same project IDs as from the training data (similar discussion posts [here](https://www.synapse.org/#!Synapse:syn26133770/discussion/threadId=9522&replyId=28168) and [here](https://www.synapse.org/#!Synapse:syn26133770/discussion/threadId=9608&replyId=28447)).
As for your follow-up questions:
> Also, if our script and model can pass the Fast Lane check, why is it not valid in the validation portal?
The Fast Lane queue mounts as its input a subset (without replacement) of the training data. The Organizers did their best to emulate the unseen test set for the Fast Lane, but as you have observed, getting a valid submission in the Fast Lane not always guarantee a success in the Task 1 and Task 2 queues.
> if the script passes the Fast Lane check, does it mean the script is suitable for official submissions?
Unfortunately, only the scored submissions from the Task 1 and 2 queues will be considered. Hi @vchung
We ran into trouble once again. Could you show the error logs for submission 9726450, please?
Also, if our script and model can pass the Fast Lane check, why is it not valid in the validation portal?
In addition, if the script passes the Fast Lane check, does it mean the script is suitable for official submissions?
Thank you again for your help!
Zidan Thank you! @polalawang ,
Here you go!
```
Traceback (most recent call last):
File "/usr/local/bin/script.py", line 54, in
metadata['converted_age'] = metadata['age'].apply(convert_age)
NameError: name 'metadata' is not defined
``` Hi @vchung
Thanks again for your help! We updated our script but it is still invalid. Could you show the error logs once more, please? The submission ID is 9726424.
Thank you so much in advance!
Zidan @polalawang ,
Your recent submission has the following:
```
Traceback (most recent call last):
File "/usr/local/bin/script.py", line 96, in
merged["probability"] = model.predict_proba(X)[:,1]
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_voting.py", line 409, in predict_proba
self._collect_probas(X), axis=0, weights=self._weights_not_none
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_voting.py", line 384, in _collect_probas
return np.asarray([clf.predict_proba(X) for clf in self.estimators_])
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_voting.py", line 384, in
return np.asarray([clf.predict_proba(X) for clf in self.estimators_])
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_weight_boosting.py", line 878, in predict_proba
decision = self.decision_function(X)
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_weight_boosting.py", line 760, in decision_function
X = self._check_X(X)
File "/usr/local/lib/python3.9/site-packages/sklearn/ensemble/_weight_boosting.py", line 84, in _check_X
return self._validate_data(
File "/usr/local/lib/python3.9/site-packages/sklearn/base.py", line 600, in _validate_data
self._check_n_features(X, reset=reset)
File "/usr/local/lib/python3.9/site-packages/sklearn/base.py", line 400, in _check_n_features
raise ValueError(
ValueError: X has 9718 features, but AdaBoostClassifier is expecting 9737 features as input.
```
Hope this helps! Hi @vchung
Thank you for your help! Could you also show the error logs for our new submission 9726381?
Thank you so much!
Zidan Hi @polalawang ,
Submission ID 9726359 received the following error:
```python
Traceback (most recent call last):
File "/usr/local/bin/script.py", line 61, in
metadata['converted_age'] = metadata['age'].apply(convert_age)
File "/usr/local/lib/python3.9/site-packages/pandas/core/series.py", line 4433, in apply
return SeriesApply(self, func, convert_dtype, args, kwargs).apply()
File "/usr/local/lib/python3.9/site-packages/pandas/core/apply.py", line 1088, in apply
return self.apply_standard()
File "/usr/local/lib/python3.9/site-packages/pandas/core/apply.py", line 1143, in apply_standard
mapped = lib.map_infer(
File "pandas/_libs/lib.pyx", line 2870, in pandas._libs.lib.map_infer
File "/usr/local/bin/script.py", line 25, in convert_age
if age < 18:
UnboundLocalError: local variable 'age' referenced before assignment
```
Good luck!