... | @@ -133,3 +133,45 @@ db.focusSents.find().pretty() |
... | @@ -133,3 +133,45 @@ db.focusSents.find().pretty() |
|
```
|
|
```
|
|
|
|
|
|
you can check the documents/books and the images of the multimodal sentences respectively.
|
|
you can check the documents/books and the images of the multimodal sentences respectively.
|
|
|
|
|
|
|
|
---
|
|
|
|
|
|
|
|
For the further examples let's assume that the required parameters have the same value as their default values.
|
|
|
|
|
|
|
|
Let's say we want that the main images of the sentences have the same size (224x224 pixels) as the highlighted images. Then, we can resize the main images which are used and save them in "../data/resizedImages" with
|
|
|
|
|
|
|
|
```
|
|
|
|
python main.py --resized_main_images ../data/resizedImages
|
|
|
|
```
|
|
|
|
|
|
|
|
If we are only interested in the multimodal sentences. Then,
|
|
|
|
|
|
|
|
```
|
|
|
|
python main.py --multimodal_sents_only on
|
|
|
|
```
|
|
|
|
|
|
|
|
will filter out the non multimodal sentences.
|
|
|
|
|
|
|
|
Let's say we want to process at most 10 documents which have at most 5 sentences and they should be picked randomly
|
|
|
|
|
|
|
|
```
|
|
|
|
python main.py --max_docs 10 --max_sents 5 --rnd_seed 42
|
|
|
|
```
|
|
|
|
|
|
|
|
If we want to define a word from the concreteness values file to be concrete/depictable if it has a value of at least 50 we can do that with
|
|
|
|
|
|
|
|
```
|
|
|
|
python main.py --concreteness_threshold 50
|
|
|
|
```
|
|
|
|
|
|
|
|
The image retrieval with CLIP can be influenced with the parameters "--candidate_imgs", "--sent_img_similarity" and "--focus_word_img_similarity". The choice of the first two parameters is based on [paper](https://www.inf.uni-hamburg.de/en/inst/ab/lt/publications/2022-wangetal-lrec.pdf). The last parameter then bases on the second one. Especially, increasing the last two ones might result in more suitable images but less multimodal sentences
|
|
|
|
|
|
|
|
```
|
|
|
|
python main.py --candidate_imgs 10 --sent_img_similarity 2.5 --focus_word_img_similarity 2.75
|
|
|
|
```
|
|
|
|
|
|
|
|
The complex word identifier can be turned off. Then, every word is classified as complex
|
|
|
|
|
|
|
|
```
|
|
|
|
python main.py --cwi off
|
|
|
|
``` |
|
|
|
\ No newline at end of file |