Courses

Summer school: Search Engine Technology

The internet would be useless without search engines. Most of us use search engine technology every day: a quick Google search, setting the destination of your Tom Tom, but also Apple Siri would not be possible without search engines.

Course Aim

In this course we will explore the world of search engines. You will learn how search engines work, what challenges they deal with, and how their performance can be measured. And even beter: you will be guided in building, evaluating, and improving your own search engine on areal-world dataset.

Topics

Information Retrieval is the scientific discipline behind search tools. This CuriousU course provides the foundations of Information Retrieval by addressing concepts like indexing, matching, and relevance. The course discusses fundamental approaches to Information Retrieval, such as ranking, controlled versus uncontrolled terms, query by example, and relevance feedback. It discusses mathematical models of Information Retrieval such as boolean retrieval, probabilistic retrieval, language models, logical models, Google's PageRank, and learning-to-rank; and it addresses applications of Information Retrieval like multimedia retrieval and of course web search engines. Please note: we strongly advise you to use your own device (Laptop) for this course

Instructional modes

During the practial sessions we will build a search engine for product data using ElasticSearch, an open source search library. Some programming knowledge is required; we will be working in Python, but if you have programmed in another language you should be fine.

Learning outcomes

  • Explain basic concepts of Search Engine Technology, such as indexing, matching, and relevance
  • Explain and apply different approaches to search including exact matching, ranking, query by example, and relevance feedback
  • Apply mathematical models of Information Retrieval (Boolean retrieval, probabilistic retrieval, language models, logical models, Google's PageRank, etc.)
  • Set up a Search Engine using open source software
  • Carry out an experimental evaluation of a Search Engine and compute evaluation measures

Course schedule

X

CuriousU: Search Engine Technology

Day
Jogging around campus
Time:15:00 - 16:00
Day:Sun. 13-08

Want to do some sports and see some cool locations on campus? During this activity you'll do both!

Jogging around campus
Time:17:00 - 18:00
Day:Sun. 13-08

Want to do some sports and see some cool locations on campus? During this activity you'll do both!

Sightseeing around campus
Time:16:00 - 17:00
Day:Sun. 13-08

Don't want to do some sports, but you do want to see some cool locations on campus? During this activity you'll walk around campus with a tourguide.

Sightseeing around campus
Time:21:00 - 22:00
Day:Sun. 13-08

Don't want to do some sports, but you do want to see some cool locations on campus? During this activity you'll walk around campus with a tourguide.

Arrival day: check in
Time:10:00 - 18:00
Day:Sun. 13-08

The registration point will be in the biggest tent. Please register as soon as you arrive. If you arrive earlier or later please contact us via curious@utwente.nl 

Explore the campus
Time:20:00 - 22:00
Day:Sun. 13-08

Explore the campus with your new friends.

Meet-up and pizza
Time:18:00 - 20:00
Day:Sun. 13-08

Eat pizza and meet up with your fellow students!

Lunch
Time:12:30 - 14:00
Day:Mon. 14-08

After the first course sessions of the day you’ll then join the other festival-goers for a themed lunch in an informal, interactive setting.

Free
Time:17:00 - 18:00
Day:Mon. 14-08

This hour is 100% free time: relax or use our sport facilities :-)

Breakfast & keynote: Ed Brinksma
Time:08:15 - 09:30
Day:Mon. 14-08

Prof.dr. H. (Ed) Brinksma, our former university dean, will kick-off the first inspiration speach @CuriousU!

Lean simulation game
Time:20:00 - 23:00
Day:Mon. 14-08

Have fun and enjoy our evening programme while playing a lean simulation game.

Dinner: buffet
Time:18:00 - 20:00
Day:Mon. 14-08

Every night we serve you a fantastic dinner.

Intro to search
Time:09:30 - 12:30
Day:Mon. 14-08
Elasticsearch 101
Time:14:00 - 17:00
Day:Mon. 14-08

After lunch you’ll dive back into your course.

Dinner: buffet
Time:18:00 - 20:00
Day:Tu. 15-08

Every night we serve you a fantastic dinner.

Breakfast & keynote: J.E. Ramirez-Marquez
Time:08:15 - 09:30
Day:Tu. 15-08

Dr. Ramirez-Marquez has been a researcher all his life with an interest in developing computational techniques to solve a variety of problems. His research efforts are currently focused on the reliability analysis and optimization of complex systems, the development of mathematical models for sensor networks operational effectiveness, the computational analysis of resilience and, the development of evolutionary optimization algorithms. He will give a talk about future trends in smart city research.

More information
Free
Time:17:00 - 18:00
Day:Tu. 15-08

This hour is 100% free time: relax or use our sport facilities :-)

Escape room & silent disco
Time:20:00 - 23:00
Day:Tu. 15-08

During theis evening you can try to get out of an escape room. Next to this we will also have a silent disco. Do not forget to bring your headphones :)

Lunch: blind tasting & nitrogen ice cream
Time:12:30 - 14:00
Day:Tu. 15-08

After the first course sessions of the day you’ll then join the other festival-goers for a themed lunch in an informal, interactive setting.


During this lunch you can do a blind tasting and eat some nitrogen ice cream. There will also be piñata's and some sticks to smash them!

Indexing
Time:09:30 - 12:30
Day:Tu. 15-08
My first search engine
Time:14:00 - 17:00
Day:Tu. 15-08

After lunch you’ll dive back into your course.

A lab search experiment
Time:14:00 - 17:00
Day:Wed. 16-08

After lunch you’ll dive back into your course.

Beach volleyball party
Time:20:00 - 23:00
Day:Wed. 16-08

Play some beach volleyball and dance to the music of Odem Ska and Reggae Band!

Lunch: happy as a child
Time:12:30 - 14:00
Day:Wed. 16-08

After the first course sessions of the day you’ll then join the other festival-goers for a themed lunch in an informal, interactive setting.

During this lunch you can be a child again and hit each other with water pistols!

Free
Time:17:00 - 18:00
Day:Wed. 16-08

This hour is 100% free time: relax or use our sport facilities :-)

Breakfast & keynote: Bram Nauta
Time:08:15 - 09:30
Day:Wed. 16-08

Bram Nauta is a professor in Electrical Engineering at the UT and he can tell you everything about chips and circuits. He invented the Nauta Circuit. This invention came to his mind while he was taking a swim. He quickly went out of the pool and asked for pen and paper. Nauta will talk about his invention.

More information
Product search
Time:09:30 - 12:30
Day:Wed. 16-08
Dinner: BBQ
Time:18:00 - 20:00
Day:Wed. 16-08

Bring along your flip-flops, because you'll have an amazing barbecue this evening.

Search evaluation
Time:09:30 - 12:30
Day:Th. 17-08
Dinner: Happy Italy
Time:18:00 - 20:00
Day:Th. 17-08

We go out for dinner at the city: Happy Italy

Measuring search quality
Time:14:00 - 17:00
Day:Th. 17-08

After lunch you’ll dive back into your course.

Grolsch Summer Sounds
Time:20:00 - 23:00
Day:Th. 17-08

Enjoy the night in the city listening to some musical entertainment.

Lunch: C- your future
Time:12:30 - 14:00
Day:Th. 17-08

After the first course sessions of the day you’ll then join the other festival-goers for a themed lunch in an informal, interactive setting.

During this lunch you'll get introduced to the courses that are being teached at the University of Twente.

Free
Time:17:00 - 18:00
Day:Th. 17-08

This hour is 100% free time: relax or use our sport facilities :-)

Breakfast & keynote: Parya Pasha Zadeh Monajjemi
Time:08:15 - 09:30
Day:Th. 17-08

Get your day off to an inspiring start by listening to Parya Pasha Zadeh Monajjemi over breakfast.


She is a Geoinformatics engineer specialized in spatial data management and GIS analysis with a keen interest in spatio-temporal analysis of movement data for understanding human behavior in the context of modern cities.

Models of search
Time:09:30 - 12:30
Day:Fri. 18-08
Dinner: buffet
Time:18:00 - 20:00
Day:Fri. 18-08

Every night we serve you a fantastic dinner.

Search models at work
Time:14:00 - 17:00
Day:Fri. 18-08

After lunch you’ll dive back into your course.

Band night
Time:20:00 - 23:00
Day:Fri. 18-08

Baby blue is returning to CuriousU! Last year it was a big succes.

Lunch: Dutch olympics
Time:12:30 - 14:00
Day:Fri. 18-08

After the first course sessions of the day you’ll then join the other festival-goers for a themed lunch in an informal, interactive setting.

During this lunch you'll do some typical Dutch games.

Free
Time:17:00 - 18:00
Day:Fri. 18-08

This hour is 100% free time: relax or use our sport facilities :-)

Breakfast & keynote: Lee Vinsel
Time:08:15 - 09:30
Day:Fri. 18-08

Lee Vinsel is an Assistant Professor of Science and Technology Studies at the Stevens Institute of Technology. He studied Philosophy and History of Technology. He will give a talk about maintenance and innovation in our current society. Capitalism excels at innovation but is failing at maintenance, and for most lives it is maintenance that matters more.

More information
Breakfast
Time:08:15 - 09:30
Day:Sat. 19-08

Have a breakfast first.

Free day / city trip etc.
Time:09:30 - 20:00
Day:Sat. 19-08

This is your day off. You can e.g. do a city trip to Amsterdam (The Netherlands), Rotterdam (The Netherlands) Münster (Germany) or Osnabrück (Germany). Have fun!

Pubquiz
Time:20:00 - 23:00
Day:Sat. 19-08

We will host a cool pubquiz.

Dinner
Time:18:00 - 20:00
Day:Sun. 20-08

Every night we serve you a fantastic dinner.

Tune your search engine
Time:14:00 - 17:00
Day:Sun. 20-08

After lunch you’ll dive back into your course.

Breakfast & keynote: Cristina Zaga
Time:08:15 - 09:30
Day:Sun. 20-08

Get your day off to an inspiring start by listening to Cristina Zaga over breakfast. She is a reasearcher in the research group Research Group Human Media Interaction.  Her  research revolves around child-robot cooperative (pro-social) interactions during collaborative games/tasks. 

Movie night
Time:20:00 - 23:00
Day:Sun. 20-08

We'll watch a cool movie together!

Lunch: dancing
Time:12:30 - 14:00
Day:Sun. 20-08

After the first course sessions of the day you’ll then join the other festival-goers for a themed lunch in an informal, interactive setting.


During this lunch you can join a dancing workshop.

Free
Time:17:00 - 18:00
Day:Sun. 20-08

This hour is 100% free time: relax or use our sport facilities :-)

How to build Google
Time:09:30 - 12:30
Day:Sun. 20-08
Dinner
Time:18:00 - 20:00
Day:Mon. 21-08

Every night we serve you a fantastic dinner.

Present your search engine
Time:14:00 - 17:00
Day:Mon. 21-08

After lunch you’ll dive back into your course.

Closing ceremony
Time:20:00 - 23:00
Day:Mon. 21-08

This night we'll close off CuriousU 2017 with everyone and you'll receive your certificate.

Lunch: labtour
Time:12:30 - 14:00
Day:Mon. 21-08

After the first course sessions of the day you’ll then join the other festival-goers for a themed lunch in an informal, interactive setting.

During this lunch you can have a tour across all the labs on campus.

Free
Time:17:00 - 18:00
Day:Mon. 21-08

This hour is 100% free time: relax or use our sport facilities :-)

Breakfast & keynote: Llewelyn van Zyl
Time:08:15 - 09:30
Day:Mon. 21-08

Get your day off to an inspiring start by listening to Llewelyn van Zyl over breakfast. He is an industrial psychologist. He has a passion for online positive psychological interventions. Van Zyl is on top of his profession, working as a specialist and consultant, and is currently serving as president of the Society for Industrial and Organisational Psychology of South Africa (SIOPSA).

Future search challenges
Time:09:30 - 12:30
Day:Mon. 21-08
Breakfast
Time:08:15 - 09:00
Day:Tu. 22-08

Have a breakfast first.

Check-out
Time:09:00 - 10:00
Day:Tu. 22-08

The last day. Check out (i.e. returning the keys) and saying goodbye to all your new friends. By returning your key you will get back your deposit for the log cabin or student dorm.

Schedule (text version)

Day 1

  • Lecture (1h.): Welcome & Introduction to Search
  • Discussion (0.5h.) Introduce yourself + personal learning objectives
  • Exercise (1.5h.) Git version management + Get the MyDatafactory product data
  • Lunch brake
  • Tutorial (1h.): Elastic Search & restful web services
  • Exercise (2h.): Install and Run Elastic Search

Day 2

  • Lecture (1.5 h.): Indexing, a conceptual view point
  • Exercise (1.5h.): Analyze matching problems in the data
  • Lunch brake
  • Tutorial (1h.): Programming Elastic with Python
  • Exercise (1.5h.): Index the collection
  • Discussion (0.5h.): Discuss matching problems

Day 3

  • Lecture (1.5h.): Evaluation of search engines & Empirical Research
  • Exercise (1.5h.): Create a "run", an experiment file
  • Lunch brake
  • Tutorial (1h.): Elastic Search: nuts & bolds
  • Exercise (1.5h.): Calculate evaluation metrics
  • Discussion (0.5h.): Discuss the baseline evaluation results

Day 4

  • Lecture (1.5h.): Models of Information Retrieval
  • Exercise (0.5h.): The QUIZ
  • Exercise (1h.): Improve your system using different models
  • Lunch brake
  • Tutorial (1h.): Elastic Search: nuts & bolds
  • Exercise (1.5h.): Improve your system using different indexers
  • Discussion (0.5h): Discuss improvements over the baseline

Day 5

  • Lecture (1.5h.): Machine Learning (ML) for search engines
  • Exercise (1.5h.): Generate pair-wise training data for ML
  • Lunch brake
  • Tutorial (1h.): Rank SVM
  • Exercise (1.5h.): Improve your system using ML
  • Discussion (0.5h): Discuss improvements from ML

Day 6

  • Lecture (1h.): How to build Google in 1.5h?
  • Exercise (1h.): Estimate Google's index size and search speed
  • Discussion (1h.): Your plan to improve product search
  • Lunch brake
  •  Tutorial (1h.): Elastic Search for experts
  • Exercise (2h.): Improve your system

Day 7

  • Lecture (1h.): Future challenges of search
  • Exercise (2h.): Work on your search engine
  • Lunch brake
  • Exercise (1.5h.): Finalize work and prepare a presentation
  • Lecture (1.5h): Student presentations: present your system


Course leaders

Djoerd Hiemstra

Djoerd Hiemstra is associate professor in database and search engine technology at the University of Twente. Djoerd also heads Searsia, a UTwente spin-off that provides open source federated search technology.

Visit Djoerd's website for more information about his research.

Dolf Trieschnigg

Is a data scientist at Mydatafactory, a company specialised in cleansing and matching product data. He is also a guest lecturer at the University of Twente. His research interests include information retrieval, information extraction and natural language processing.

Vist Dolf's website for more information about his research.

Want to know more?

If you need to know more, we have information available about the fee & programme, an admission check, registration & payment, visa, accomodation and the terms & conditions.