Skip to content

Conversation

@eelstretching
Copy link
Member

…rastructure

Description

The performance of the transformers was suboptimal when there was a large number of features. Got the transformation time for a large text classification problem down from more than two hours to less than two minutes. Along the way, I added some logging (at Level.FINE) so you can see what's happening, if you choose to do so.

As I was working on a text classification problem, I added and IDFTransformation that can set TFIDF weights in the example.

Motivation

Transformer performance was far too slow in some cases.

Copy link
Member

@Craigacp Craigacp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few small things to fix.

@eelstretching
Copy link
Member Author

Somehow I also managed to not add IDFTransformer to the original commit.

Copy link
Member

@Craigacp Craigacp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Craigacp Craigacp merged commit 1f142d9 into oracle:main Dec 10, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants