This commit is contained in:
Elad Katz 2024-08-10 03:34:47 -04:00 committed by GitHub
commit ecc9e46c17
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -1742,7 +1742,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"An image recognizer can, as its name suggests, only recognize images. But a lot of things can be represented as images, which means that an image recogniser can learn to complete many tasks.\n",
"An image recognizer can, as its name suggests, only recognize images. But a lot of things can be represented as images, which means that an image recognizer can learn to complete many tasks.\n",
"\n",
"For instance, a sound can be converted to a spectrogram, which is a chart that shows the amount of each frequency at each time in an audio file. Fast.ai student Ethan Sutin used this approach to easily beat the published accuracy of a state-of-the-art [environmental sound detection model](https://medium.com/@etown/great-results-on-audio-classification-with-fastai-library-ccaf906c5f52) using a dataset of 8,732 urban sounds. fastai's `show_batch` clearly shows how each different sound has a quite distinctive spectrogram, as you can see in <<img_spect>>."
]