Why? Most humans (and we exclude all persons below 18) cannot hear the difference between recordings with AAC46 or AAC96 when it comes to speech. That means: No background noise/music, no nothing ... just speech (or moaning)
AAC96 is enough for background music and videos like "Star Wars: Clone Wars". I never tested it, but for high quality music AAC96 might not be enough. IMHO AAC64 equals MP3-128 a standard for music between 1996 and 2002.
Hach! Star Wars Clone Wars ...
A major difference with AAC48 is music and sounds like an AC.
Facts
38:36 (38,5) minutes with AAC64 vs AAC96
AAC64
Duration : 38 min 36 s
Stream size : 17.8 MiB (1%)
AAC96
Duration : 38 min 36 s
Stream size : 26.7 MiB (1%)... less than 10MB in 38,5 minutes.
AAC48 vs AAC64
AAC48
Audio
Format : AAC
Duration : 22 min 44 s
Bit rate mode : Constant
Bit rate : 48.0 kb/s
Maximum bit rate : 48.7 kb/s
Stream size : 7.91 MiB (1%)
AAC64
Audio
Format : AAC
Duration : 22 min 44 s
Bit rate mode : Constant
Bit rate : 66.2 kb/s
Maximum bit rate : 64.8 kb/s
Sampling rate : 48.0 kHz
Stream size : 10.5 MiB (1%)... less than 3MB in almost 23 minutes.
I downloaded vids from Noodlemagazine with 240p, but the sound was AAC128 or even AAC312 (rare).
For example:
https://www.avsubtitles.com/movie136/scooby-doo-a-xxx-parody-2011
You can download it with 240p and AAC126 - which is very high just for the speech. The movie has a lot of sound effects and music of course, but I bet you would not heare a difference in AAC96 (or even AAC64).

