Python: Add SSRF queries #7420

RasmusWL · 2021-12-16T01:03:19Z

I've added 2 queries:

one that detects full SSRF, where an attacker can control the full URL, which is always bad
and one for partial SSRF, where an attacker can only control parts of an URL (such as the path, query parameters, or fragment), which is not a big problem in many cases (but could still be exploitable)

full SSRF should run by default, and partial SSRF should not (but having the query included makes it easy to run). I got inspired by this setup from Java where they have a precise and imprecise version of the same query.

Current status

Most of the query work/library modeling is done (although we could always add support for more libraries). Still need to do:

write qhelp
write change-note
Polish sanitizer for full SSRF query so we're able to detect "https://" + user_input is in fact controlling the full URL.
verify FP rates from run across many repos
verify performance looks ok
make some updates to the Ruby code so we're better aligned (but that can wait until after this PR is merged I think)

Commits

Some of the commits changes the concepts that was added in the very first commit. I've kept things this way so it could help to illustrate why I wanted to diverge from the Ruby code.

What is SSRF even?

See https://portswigger.net/web-security/ssrf if you need a refresher on SSRF 😊

The text was updated successfully, but these errors were encountered:

Taken from Ruby, except that `getURL` member predicate was changed to `getUrl` to keep consistency with the rest of our concepts, and stick to our naming convention.

For the snippet below, our current query is able to show _why_ we consider `var` to be a falsey value that would disable SSL/TLS verification. I'm not sure we're going to need the part that Ruby did, for being able to specify _where_ the verification was removed, but we'll see. ``` requests.get(url, verify=var) ```

Also adjusts test slightly. Writing `clientRequestDisablesCertValidation=False` to mean that certificate validation was disabled by the `False` expression is just confusing, as it easily reads as _certificate validate was NOT disabled_ :| The new one ties to each request that is being made, which seems like the right setup.

I think `getUrl` is a bit too misleading, since from the name, I would only ever expect ONE result for one request being made. `getAUrlPart` captures that there could be multiple results, and that they might not constitute a whole URl. Which is the same naming I used when I tried to model this a long time ago https://github.com/github/codeql/blob/a80860cdc6b06b363b0d0919600ab383a470b449/python/ql/lib/semmle/python/web/Http.qll#L102-L111

I've added 2 queries: - one that detects full SSRF, where an attacker can control the full URL, which is always bad - and one for partial SSRF, where an attacker can control parts of an URL (such as the path, query parameters, or fragment), which is not a big problem in many cases (but might still be exploitable) full SSRF should run by default, and partial SSRF should not (but makes it easy to see the other results). Some elements of the full SSRF queries needs a bit more polishing, like being able to detect `"https://" + user_input` is in fact controlling the full URL.

python/ql/test/library-tests/frameworks/requests/taint_test.py

Co-authored-by: yoff <[email protected]>

Since that might not be the same place where the vulnerable URL part is.

Now full-ssrf will only alert if **all** URL parts are fully user-controlled.

They were very misleading before, because a sanitizer that happened early, would remove taint from the rest of the cases by use-use flow :|

python/ql/lib/semmle/python/frameworks/Requests.qll

Accidentally committed :|

I included examples of both types in the qhelp of both queries, to provide context of what each of them actually are.

python/ql/test/query-tests/Security/CWE-918-ServerSideRequestForgery/full_partial_test.py

…orgery/full_partial_test.py

yoff

LGTM - thanks for the offline explanations

That was changed in 9866214

yoff

Lgtm

RasmusWL · 2021-12-17T15:20:35Z

RasmusWL added 12 commits Dec 13, 2021

Python: Add HTTP::Client::Request concept

5de79b4

Taken from Ruby, except that `getURL` member predicate was changed to `getUrl` to keep consistency with the rest of our concepts, and stick to our naming convention.

Python: Clearer sourceType for client response body

08f6d1a

Python: Add modeling of requests

b68d280

Python: Consider taint of client http requests

35cba17

Python: Model requests Responses

cf2ee06

Python: Add tests of http.client.HTTPResponse

a5bae30

Python: Add modeling of http.client.HTTPResponse

6f81685

Python: Remove getResponse and do manual taint steps

579de0c

RasmusWL requested a review from yoff Dec 16, 2021

github-actions bot added documentation Python labels Dec 16, 2021

yoff reviewed Dec 16, 2021

View changes

python/ql/test/library-tests/frameworks/requests/taint_test.py Outdated Show resolved Hide resolved

RasmusWL and others added 8 commits Dec 16, 2021

Python: Apply suggestions from code review

6ce1524

Co-authored-by: yoff <[email protected]>

Python: Minor adjustments to QLDoc of HTTP::Client::Request

5a7efd0

Python: Add interesting test-case

b1bca85

Python: Adjust SSRF location to request call

cb934e1

Since that might not be the same place where the vulnerable URL part is.

Python: Improve full/partial SSRF split

4b5599f

Now full-ssrf will only alert if **all** URL parts are fully user-controlled.

Python: Fix SSRF sanitizer tests

6f297f4

They were very misleading before, because a sanitizer that happened early, would remove taint from the rest of the cases by use-use flow :|

Python: Add tricky .format SSRF tests

8d9a797

Python: Allow http[s]:// prefix for SSRF

1d00730

intrigus-lgtm reviewed Dec 17, 2021

View changes

python/ql/lib/semmle/python/frameworks/Requests.qll Show resolved Hide resolved

RasmusWL added 2 commits Dec 17, 2021

Python: Remove debug predicate

e309d82

Accidentally committed :|

Python: Add SSRF change-note

e7abe43

RasmusWL requested a review from yoff Dec 17, 2021

Python: Add SSRF qhelp

83f1b2c

I included examples of both types in the qhelp of both queries, to provide context of what each of them actually are.

RasmusWL marked this pull request as ready for review Dec 17, 2021

RasmusWL requested a review from as a code owner Dec 17, 2021

yoff reviewed Dec 17, 2021

View changes

python/ql/test/query-tests/Security/CWE-918-ServerSideRequestForgery/full_partial_test.py Outdated Show resolved Hide resolved

yoff and others added 2 commits Dec 17, 2021

Update python/ql/test/query-tests/Security/CWE-918-ServerSideRequestF 1�7

9866214

…orgery/full_partial_test.py

Python: Fix typo

626009e

yoff reviewed Dec 17, 2021

View changes

Python: Adjust .expected based on new comment

83f87f0

That was changed in 9866214

RasmusWL dismissed yoff’s stale review via 83f87f0 Dec 17, 2021

RasmusWL requested a review from yoff Dec 17, 2021

yoff approved these changes Dec 17, 2021

View changes

codeql-ci merged commit 5054d5b into github:main Dec 17, 2021
15 checks passed

RasmusWL deleted the ssrf-new branch Dec 17, 2021

RasmusWL mentioned this pull request Jan 4, 2022

Python: Draft for SSRF query #2933

Closed

github / codeql Public

Python: Add SSRF queries #7420

Python: Add SSRF queries #7420

RasmusWL commented Dec 16, 2021 •

edited

yoff left a comment

yoff left a comment

RasmusWL commented Dec 17, 2021

github / codeql Public

Python: Add SSRF queries #7420

Python: Add SSRF queries #7420

Conversation

RasmusWL commented Dec 16, 2021 • edited

Current status

Commits

What is SSRF even?

yoff left a comment

yoff left a comment

RasmusWL commented Dec 17, 2021

RasmusWL commented Dec 16, 2021 •

edited