Use python tokenizer to identify single-line comments and docstrings. #106

ashanbrown · 2017-03-06T06:27:55Z

Perform the raw analysis using a single iterator.
Add test cases that cover line continuations and unusual docstrings.

coveralls · 2017-03-06T06:29:26Z

Coverage increased (+0.01%) to 98.281% when pulling 7c585d5 on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-06T06:31:54Z

Coverage increased (+0.01%) to 98.281% when pulling 79548fd on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

ashanbrown · 2017-03-06T14:41:18Z

Ciao @rubik . I was having trouble performing a raw analyze on a codebase so I rewrote much of the comment recognizing code from an early PR. In particular, the code I was having a problem with looked like this:

def function():
    multiline_with_equals_in_it = """ """
    pass

There is some existing code that tries to recognize the position of '=' in multi-line string assignments that makes assumptions that may not always be the case. I have replaced this code with code that uses the python tokenizer to recognize strings and comments rather than using a custom parser. I've also tried to reduce the number of times radon parses the code to a single time.

I have only changed one existing test case, which appeared to be expecting the wrong number of blank lines. I'm assuming we should count lines as blank whether or not they are inside multi-line strings.

Please let me know what I can do to get this into the codebase. Thank you.

ashanbrown · 2017-03-06T14:59:31Z

radon/tests/test_raw.py

@@ -187,7 +187,7 @@ def fib(n):
         """
         if n <= 1: return 1  # otherwise it will melt the cpu
         return fib(n - 2) + fib(n - 1)
-     ''', (6, 9, 10, 2, 3, 1, 1)),
+     ''', (6, 9, 10, 2, 3, 2, 1)),


I believe that '2' is the correct value for number of blank lines in this code. I think previously the line in the multiline string had been excluded.

coveralls · 2017-03-06T15:06:52Z

Coverage increased (+0.01%) to 98.281% when pulling dd0eb9e on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-06T15:06:52Z

Coverage increased (+0.01%) to 98.281% when pulling dd0eb9e on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-06T15:18:41Z

Coverage increased (+0.2%) to 98.424% when pulling 220ab6b on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-06T15:18:41Z

Coverage increased (+0.2%) to 98.424% when pulling 220ab6b on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-06T15:18:41Z

Coverage increased (+0.2%) to 98.424% when pulling 220ab6b on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-06T15:54:43Z

Coverage increased (+0.2%) to 98.424% when pulling 1305669 on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-06T15:54:43Z

Coverage increased (+0.2%) to 98.424% when pulling 1305669 on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

Perform the raw analysis using a single iterator. Add test cases that cover line continuations and unusual docstrings.

coveralls · 2017-03-07T18:34:22Z

Coverage increased (+0.2%) to 98.424% when pulling 5f05f1b on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-07T18:34:22Z

Coverage increased (+0.2%) to 98.424% when pulling 5f05f1b on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

coveralls · 2017-03-07T18:34:22Z

Coverage increased (+0.2%) to 98.424% when pulling 5f05f1b on ashanbrown:update-docstring-analysis into 52daa6c on rubik:master.

rubik · 2017-03-07T19:50:39Z

Thanks Andrew! I'll review this tomorrow but at a cursory glance it looks good. The code you provided definitely crashes the analysis, so something will have to be done for sure.

rubik · 2017-03-14T09:56:33Z

Sorry for the delay Andrew! The PR looks good and I'm merging it. Thanks again!

ashanbrown · 2017-03-27T00:29:21Z

Thanks @rubik . Please let me know if there are any reported issues with this change. I'll be happy to try to fix them.

ashanbrown force-pushed the update-docstring-analysis branch from 7c585d5 to 79548fd Compare March 6, 2017 06:30

ashanbrown commented Mar 6, 2017

View reviewed changes

ashanbrown force-pushed the update-docstring-analysis branch from 220ab6b to 1305669 Compare March 6, 2017 15:51

ashanbrown changed the title ~~Use AST to find single-line comments and docstrings.~~ Use python tokenizer to identify single-line comments and docstrings. Mar 7, 2017

Andrew S. Brown added 2 commits March 7, 2017 10:31

Use python tokenizer to identify single-line comments and docstrings.

4b7594e

Perform the raw analysis using a single iterator. Add test cases that cover line continuations and unusual docstrings.

Require py.test 2.7 or more.

5f05f1b

ashanbrown force-pushed the update-docstring-analysis branch from 1305669 to 5f05f1b Compare March 7, 2017 18:32

rubik merged commit a34f0b6 into rubik:master Mar 14, 2017

ashanbrown deleted the update-docstring-analysis branch March 27, 2017 00:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use python tokenizer to identify single-line comments and docstrings. #106

Use python tokenizer to identify single-line comments and docstrings. #106

ashanbrown commented Mar 6, 2017

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 6, 2017 •

edited

Loading

ashanbrown commented Mar 6, 2017 •

edited

Loading

ashanbrown Mar 6, 2017

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 7, 2017

coveralls commented Mar 7, 2017

coveralls commented Mar 7, 2017 •

edited

Loading

rubik commented Mar 7, 2017 •

edited

Loading

rubik commented Mar 14, 2017

ashanbrown commented Mar 27, 2017

Use python tokenizer to identify single-line comments and docstrings. #106

Use python tokenizer to identify single-line comments and docstrings. #106

Conversation

ashanbrown commented Mar 6, 2017

coveralls commented Mar 6, 2017 • edited Loading

coveralls commented Mar 6, 2017 • edited Loading

ashanbrown commented Mar 6, 2017 • edited Loading

ashanbrown Mar 6, 2017

Choose a reason for hiding this comment

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017

coveralls commented Mar 6, 2017 • edited Loading

coveralls commented Mar 6, 2017 • edited Loading

coveralls commented Mar 6, 2017 • edited Loading

coveralls commented Mar 7, 2017

coveralls commented Mar 7, 2017

coveralls commented Mar 7, 2017 • edited Loading

rubik commented Mar 7, 2017 • edited Loading

rubik commented Mar 14, 2017

ashanbrown commented Mar 27, 2017

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 6, 2017 •

edited

Loading

ashanbrown commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 6, 2017 •

edited

Loading

coveralls commented Mar 7, 2017 •

edited

Loading

rubik commented Mar 7, 2017 •

edited

Loading