MemesNet

A multi-modality deep learning model to detect hateful memes. The code is submitted as a solution to the task from Prof. Ponnurangam Kumaraguru's Precog Lab @IIIT Hyderabad.

Description

The Hateful Memes Challenge and Dataset have been developed to advance research in identifying harmful multimodal content, particularly focusing on the challenging task of recognizing multimodal hate speech, which combines different modalities such as text and images. This unique dataset, created by Facebook AI, consists of over 10,000 new multimodal examples, incorporating content licensed from Getty Images to provide a diverse and comprehensive resource for AI researchers.

Environment Setup

Conda Setup

To ensure a consistent environment, it is recommended to use conda for managing dependencies. Follow these steps to set up the environment using the provided environment.yml file.

Clone the repository:

git clone https://github.com/omm-prakash/MemesNet.git
cd MemesNet

Create conda environment:
```
conda env create -f environment.yml
```
Activate the environment:
```
conda activate memes
```

Dataset Download

Download the dataset from Hateful Memes Challenge and follow the steps below to organize your project structure:

Move to the data directory from the project root:
```
cd ./data
```
Save the downloaded dataset file into the data directory.

Unzip the dataset:

cd ..
unzip data/your-dataset.zip -d data/

Now the environment is set up, and the dataset is ready for use in the project.

Usage

1. Object Detection

To perform object detection on images using YOLO (You Only Look Once) model, run the following command:

python object_detection.py \
  --directory ./data/hateful_memes/img/ \
  --file <path_to_image_file> \
  --pick_random <True/False> \
  --plot <True/False> \
  --config data/yolov3.cfg \
  --weight data/yolov3.weights \
  --classes data/coco.names

--directory: Path of the folder containing all images.
--file: Image file path (optional, if not using random image).
--pick_random: Select a random image for detection (default: True).
--plot: Show the image with detected objects (default: True).
--config: Path for YOLO model configuration file (default: data/yolov3.cfg).
--weight: Path for YOLO model weight file (default: data/yolov3.weights).
--classes: Path for COCO class name file (default: data/coco.names).

2. Optical Character Recognition (OCR)

To extract text from images using OCR, use the following command:

python ocr.py \
  --directory ./data/hateful_memes/img/ \
  --file <path_to_image_file> \
  --draw_anno <True/False> \
  --pick_random <True/False> \
  --plot <True/False>

--directory: Path of the folder containing all images.
--file: Image file path (optional, if not using random image).
--draw_anno: Draw annotations on the image (default: False).
--pick_random: Select a random image for OCR (default: True).
--plot: Show the image with extracted text (default: True).

3. Training

To initiate the training process with a modified configuration in config.yml, execute the following command:

python train.py

Ensure that you have updated the necessary configurations in the config.yml file before starting the training process.

License

The project has not been licensed till now.

Acknowledgements

The cross-modality encoder part is referred from LXMERT.

Contact

Please contact me at [email protected] for any query related to the code.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
__pycache__		__pycache__
data		data
pytorchyolo		pytorchyolo
.gitignore		.gitignore
README.md		README.md
config.yml		config.yml
data_analysis.ipynb		data_analysis.ipynb
dataset.py		dataset.py
environment.yml		environment.yml
model.py		model.py
object_detection.py		object_detection.py
ocr.py		ocr.py
task.pdf		task.pdf
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MemesNet

Description

Environment Setup

Conda Setup

Dataset Download

Usage

1. Object Detection

2. Optical Character Recognition (OCR)

3. Training

License

Acknowledgements

Contact

About

Releases

Packages

Languages

omm-prakash/MemesNet

Folders and files

Latest commit

History

Repository files navigation

MemesNet

Description

Environment Setup

Conda Setup

Dataset Download

Usage

1. Object Detection

2. Optical Character Recognition (OCR)

3. Training

License

Acknowledgements

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages