Hi @vchung, could you tell me the error message for submission 9723697? Thank you in advance!
Created by yaojm Hello,
Sure, please see the error below :
/submission/run_model_v3.py:68: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
metadata['race'] = clean_race_cat(metadata['race'])
Traceback (most recent call last):
File "/submission/run_model_v3.py", line 179, in
main()
File "/submission/run_model_v3.py", line 158, in main
large_df = large_df.drop(['CST_IV-A', 'score:CST_IV-A'], axis=1) # drop 'CST_IV-A', 'score:CST_IV-A'
File "/usr/local/lib/python3.10/site-packages/pandas/util/_decorators.py", line 311, in wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/pandas/core/frame.py", line 4954, in drop
return super().drop(
File "/usr/local/lib/python3.10/site-packages/pandas/core/generic.py", line 4267, in drop
obj = obj._drop_axis(labels, axis, level=level, errors=errors)
File "/usr/local/lib/python3.10/site-packages/pandas/core/generic.py", line 4311, in _drop_axis
new_axis = axis.drop(labels, errors=errors)
File "/usr/local/lib/python3.10/site-packages/pandas/core/indexes/base.py", line 6644, in drop
raise KeyError(f"{list(labels[mask])} not found in axis")
KeyError: "['CST_IV-A', 'score:CST_IV-A'] not found in axis"
Hope this helps.
Thanks
Aarthi @aarthikr Thank you so much! And could you please check on submission 9723796 as well? Hi Jiaming,
Below is the error I see -
/submission/run_model_v3.py:68: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
metadata['race'] = clean_race_cat(metadata['race'])
Traceback (most recent call last):
File "/submission/run_model_v3.py", line 179, in
main()
File "/submission/run_model_v3.py", line 158, in main
large_df = large_df.drop(['CST_IV-A', 'score:CST_IV-A'], axis=1) # drop 'CST_IV-A', 'score:CST_IV-A'
File "/usr/local/lib/python3.10/site-packages/pandas/util/_decorators.py", line 311, in wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/pandas/core/frame.py", line 4954, in drop
return super().drop(
File "/usr/local/lib/python3.10/site-packages/pandas/core/generic.py", line 4267, in drop
obj = obj._drop_axis(labels, axis, level=level, errors=errors)
File "/usr/local/lib/python3.10/site-packages/pandas/core/generic.py", line 4311, in _drop_axis
new_axis = axis.drop(labels, errors=errors)
File "/usr/local/lib/python3.10/site-packages/pandas/core/indexes/base.py", line 6644, in drop
raise KeyError(f"{list(labels[mask])} not found in axis")
KeyError: "['CST_IV-A', 'score:CST_IV-A'] not found in axis"
Hope this helps.
Thanks
Aarthi Hi @albrecht,
Thank you so much for your last reply! I tried another submission with the two columns mentioned in the error log removed, but it still does not work. Could you disclose the error log for submission 9723796? Many thanks!
Best,
Jiaming Yao Hi @yaojm , the error I see is
```
Feature names seen at fit time, yet now missing:
- CST_IV-A
- score:CST_IV-A
File "/submission/run_model_ept_adaboost_v2.py", line 177, in
File "/submission/run_model_ept_adaboost_v2.py", line 160, in main
Adaboost_pred = model_ept.predict_proba(X_test)[:,1] # prob of early preterm birth
...
ValueError: X has 9739 features, but AdaBoostClassifier is expecting 9741 features as input.
```
It looks like the model was expecting a couple of columns that are missing in the test set
-Jake