Enhancing Analyzer Accuracy for String Parsing and Special Characters Handling #51

ksg97031 · 2023-08-27T06:04:19Z

The current analyzer employs a fundamental string parsing logic (regular expressions, string splitting ..), which means that it is not guaranteed to be 100% accurate. This is because some characters, such as double quotes, can be interpreted as special characters by the analyzer.

For example, the following Python code:

from django.urls import path
from . import views

urlpatterns = [
    path("example\"'route", views.app2_index, name="index"), 
]

will not be parsed correctly by the analyzer because the double quotes(") are interpreted as part of the path variable.

Similarly, the following Go code:

e.GET("/pet,comma", func(c echo.Context) error {
    return c.String(http.StatusOK, "Hello, Pet!")
})

will also not be parsed correctly because the comma(,) is interpreted as a delimiter.

As this doesn't represent a universal scenario, I'm not sure whether to keep as a known issue or implement a shared lexer and parser to handle these cases more comprehensively.

hahwul · 2023-08-27T14:59:10Z

You're right. guaranteeing perfection is challenging due to our tool's reliance on regular expressions and string matching for analysis. Creating a Lexer/Parser would involve abstracting the code, considering each language's syntax, and identifying endpoints. This would necessitate significant changes to our current structure.

While I agree it's the right long-term direction, taking the first step is proving to be tough 😨

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancing Analyzer Accuracy for String Parsing and Special Characters Handling #51

Enhancing Analyzer Accuracy for String Parsing and Special Characters Handling #51

ksg97031 commented Aug 27, 2023 •

edited

hahwul commented Aug 27, 2023 •

edited

Enhancing Analyzer Accuracy for String Parsing and Special Characters Handling #51

Enhancing Analyzer Accuracy for String Parsing and Special Characters Handling #51

Comments

ksg97031 commented Aug 27, 2023 • edited

hahwul commented Aug 27, 2023 • edited

ksg97031 commented Aug 27, 2023 •

edited

hahwul commented Aug 27, 2023 •

edited