Personal Blog Week 13

This week while I am producing more XML files for the five directors, I am trying to design a mechanism to showcase how flexible and transformable XML files could be. My first trial is about XML-XSLT-HTML, which means I use XSLT (Extensible Stylesheet Language Transformations) to transfer my XML code into HTML for web content.

XML: https://drive.google.com/file/d/1GrrzZob4bq5U9sFbLRZ1OTTojxi2N2BY/view?usp=share_link

XSLT: https://drive.google.com/file/d/1V97Dj4S4TNtniGOmGAp3WrxePm-GtIZi/view?usp=share_link

And here is the screenshot of my result.

XML-XSLT-HTML

 

As you can see in my XML file (upper left), I have adjusted the structure in XSLT five times to get the current result (button left). So then, I will ask members to revisit their structures and give me the files. I will guide everybody through the DTD and XSLT tomorrow; ideally, we will get three pilot samples that we could show on our website. A total of 15 XML files will be saved in our shared folder. We decided to do 3 pilot example HTML pages, and instead of only showing the final result, we will tell the behind-the-scenes story with screenshots or screen recordings of our codes. The time probably wouldn’t allow us to do all 15 in this XML-XSLT-HTML format, but if anyone follows our workflow, they could get an HTML code by spending some time and effort.

Designing our DTDs and XSLT is difficult because we encoded our files individually. It is just so hard to do collective encoding this semester due to time limits and course structure. We don’t have a mechanism like the Orlando project, so we tried our best to give pilots and show our workflow.

You might notice that we use different tags, and even for one director, for different films, some tags appear, and some do not. (Because for some films, we just couldn’t find that much information). So, in this case, we need to give a conditional code in XSLT as xsl: if to make sure there is no empty content in our display. This if logic also works in our DTDs in which a question mark appears after the element like this “<!ELEMENT div2 (title, movie_info, based_on?)>”, indicating that the element is optional (it can be present once or not at all).

So to give a bigger picture, I will mention that we collect the data, choose annotation tools as pointers, and write the XML code in my presentation. While we give an example of XML-XSLT-HTML, there are also possible options for JSON, CSV, SQL database, or mobile applications. And you may wonder about the relationship between XML and TEI; this is my work this week before the final delivery to explain the connection between XML-TEI. Actually, the Orlando team started with the SGML and transferred their codes to the TEI structure, which I guess is for publication purposes.

Something you might read further:  (as a quick answer to why we are building XML files on women directors by our scholarly annotations instead of scraping data directly from the internet)

I tested a way to retrieve XML information of Kayo Hatta using Google Colab from IMDB:

https://colab.research.google.com/drive/1ltiKClrJCG9-MSmFXA6fLKgkO0dTMpki?usp=share_link

Look at the result and how limited and short it is:

IMDB XML file

Week 13—MTS personal update

It’s been a great week. There are all the loose ends to sort of tie together, but they are being tied. We had a great meeting on Saturday and settled on key dates to wrap things up and start testing, and it all feels doable. Knowing me, I’ll throw in some extra tasks (still more activists who could be written up and added to this iteration!!), but I’ll try not to be unreasonable. 

Last Saturday, Estefany and I attended the 11th Annual Columbia University Pow wow. Estefany met my kids, and I got to share a little pan-Indian culture with her. We put in a valiant effort to win the potato dance along side my sons, and we danced a twisty-turny two-step. Although there was nothing specific to the Wampanoag on display, going to pow wows always bring strong feelings of taking in strong medicine. The drumbeat always heals me and centers me, and I enjoy the flashbacks of spending summers on the pow wow circuit with my mother.  

Before the break I was able to correspond with Professor George Price a bit, and I’m looking forward to getting more of his input on the work as a whole. We started an interesting discussion about the threat of kidnapping faced by Wampanoag of mixed heritage who didn’t live within native communities. He also pointed me to a passage in his book, The Eastons: Five Generations of Human Rights Activism, 1748-1935 (about a prominent and exceptional mixed family of Wampanoag heritage from whom he depends), in which Hosea Easton considers the different attitudes that whites had towards Indians and Afro-Americans. In it H. Easton offers that whites had gotten what they wanted from the native community: land, and so they no longer considered them as a threat or even really existing, despite evidence to the contrary. On the other hand, it was their very bodies and labor of Afro-Americans that whites wanted and that was a perpetual location for exploitation holding a continued emotional/political/economic charge. This has me thinking about how the Wampanoag, mixed and otherwise, denied this role as innocuous or irrelevant subdued wards, and made their presence and intentions known through, among other things, their activism. 

Week 13_Ob&Up-date: inching speedily towards “done”

STATUS & SCOPE

Someone (Paul Valery maybe) said: A poem is never finished, only abandoned. Ob&Up is the poem and will have to be (temporarily) abandoned soon — best to do it at a rewarding juncture. The point of temporary abandonment we aim for (i.e., our definition of “done”): a functional website with an operational Bingo game. The website also features content/narrative that details and contextualizes our findings, includes team info, and a link to an accessible corpus on Git Hub. 

The scope of the project is becoming more apparent. As expected, we are finding more questions to pursue in text analysis. The trajectory of text analysis has always been unpredictable. Based on previous experiences working on artistic and literary analysis projects, I know that once analysis begins, deeper layers of analysis present themselves. So at this stage it is less about scaling down an initially too-ambitious scope and more about tempering ever more burgeoning ideas for new queries. 

TEXT ANALYSIS

With that in mind, Teddy and Maria will use this week (until 4/26) to pursue a final round of additional text analysis. Teddy will predominantly work with the full corpora for both seasons. Maria will mostly work with the judging segments of our anchor corpus, season 12. We plan to document our findings with an eye on creating copy for the site and constructing our larger argument.

Next week (until 5/3): We will combine all final text analysis findings to finish the website copy, and Maria will also be revisiting the presentation slides. Teddy will focus on making the corpora available via Git Hub.

BINGO and WEBSITE

Nuraly has finished coding the bingo game’s “outline.”  Now we have to create custom tiles for the 25 fields of the game card. Each tile will feature one typical GBBS judging phrase. Maria has passed the chosen phrases on to RC, and this week, RC will create the tiles and pass them on to Nuraly, who will add them to the bingo game. Nuraly and RC will continue working together to ensure that the bingo plug-in functions as intended on the website. (The bingo’s functionality can and will be tested before the custom tiles are added.)Since we have decided to move the site off the limited Commons version and upgrade to a paid version, RC will continue to work on the site’s infrastructure. There are plenty of technical aspects associated with the upgrade for RC to wrangle, and she will continue to do that this week. 

Next week: RC and Maria will collaborate to implement narrative website content, and Nuraly will begin to reach out to the GBBS Redditors. We might ask them for feedback and input on the game as soon as we can.

Overall, it feels as though we have a grasp on what needs to be done before the launch, and the task list looks (a little) more achievable than daunting.

 

Week 13_Ob&Up-date: inching speedily toward “done”

STATUS & SCOPE

Someone (Paul Valery maybe) said: A poem is never finished, only abandoned. Ob&Up is the poem and will have to be (temporarily) abandoned soon — best to do it at a rewarding juncture. The point of temporary abandonment we aim for (i.e., our definition of “done” ): a functional website with an operational Bingo game. The website also features content/narrative that details and contextualizes our findings, includes team info, and a link to an accessible corpus on Git Hub. 

The scope of the project is becoming more apparent. As expected, we are finding more questions to pursue in text analysis. The trajectory of text analysis has always been unpredictable. Based on previous experiences working on artistic and literary analysis projects, I know that once analysis begins, deeper layers of analysis keep presenting themselves. So at this stage it is less about scaling down an initially too-ambitious scope and more about tempering ever more burgeoning ideas for queries. 

TEXT ANALYSIS

With that in mind, Teddy and Maria will use this week (until 4/26) to pursue a final round of additional text analysis. Teddy will predominantly work with the full corpora for both seasons. Maria will mostly work with the judging segments of our anchor corpus, season 12. We plan to document our findings with an eye on creating copy for the site and constructing our larger argument.

Next week (until 5/3): We will combine all final text analysis findings to finish the website copy, and Maria will also be revisiting the presentation slides. Teddy will focus on making the corpora available via Git Hub.

BINGO and WEBSITE

Nuraly has finished coding the bingo game’s “outline.”  Now we have to create custom tiles for the 25 fields of the game card. Each tile will feature one typical GBBS judging phrase. Maria has passed the chosen phrases on to RC, and this week, RC will create the tiles and pass them on to Nuraly, who will add them to the bingo game. Nuraly and RC will continue working together to ensure that the bingo plug-in functions as intended on the website. (The bingo’s functionality can and will be tested before the custom tiles are added.)Since we have decided to move the site off the limited Commons version and upgrade to a paid version, RC will continue to work on the site’s infrastructure. There are plenty of technical aspects associated with the upgrade for RC to wrangle, and she will continue to do that this week. 

Next week: RC and Maria will collaborate to implement narrative website content, and Nuraly will begin to reach out to the GBBS Redditors. We might ask them for feedback and input on the game as soon as we can.

Overall, it feels as though we have a grasp on what needs to be done before the launch, and the task list looks (a little) more achievable than daunting. 

 

DirectHERS – Week 13 Group Update

We are sprinting toward the final outcome. The research group aims to finish all XML files by following Monday with any DTD files associated with them. We are a bit delayed in making the XML files because some structure issues are pending solutions, which is essential to the search functionality. During the break, we finished the draft of our About page, and each member met with Gemma individually to discuss their personal page info. We decided to give up words like underrepresented, and marginalized and use groundbreaking. We wrote that the team explores the incredibly diverse, worldwide voices of women film directors and focuses light on their work in a digitally encoded collection.

“The fifteenth groundbreaking women directors we choose to work on, representing diverse cultural and ethnic backgrounds, have made significant contributions to cinema by shedding light on unique non-mainstream perspectives and critical social issues. There is a worldwide need to lessen discrimination and respect non-mainstream experiences, identities, and knowledge to broaden our societal goal of achieving fundamental human rights.”

Some of us are also interesting in drawing connections and visualizing the similitudes between the films and the careers of these women. For example, what does the cinema produced by Lulu Wang, Chinese American director, have to do with the cinema of Iranian filmmaker Rakhshān Banietemad? Portraying those connections could be another way to investigate and answer: Why/how is our project’s methodology feminist? But also, Are the works of each director contributing to a feminist cinema?

With those questions in mind, we figured we could include a Tag Network in our website visualizing each director’s structure. The Tag Network can help the user see the elements we used to encode and draw connections between their profiles, their films, interests and works. We’ve come up with a few mockups on how it could look like, which are included below. If you have time, we would appreciate any feedback or ideas on their design!

Finally, starting next week, the research team will move on to write their lab notes on tags and XML encoding. Maria and Miaoling will figure out a video plan to visually show our work in less than 1 minute. Gemma will move the tag glossary to the website into three columns that have been designed in our mockup.

Tag Networks:

Network Tag - Draft A. Shows the tags used to encode one of the director profile.

MTS—Week 13 Team Update

We’ve streamlined, fine-tuned and realigned and we’re now a lean-mean-closing machine. We’ve cut back on unessential functionality and more expansive content, and have opted for a pared down but still highly engaging and impactful first iteration of the project. The project, via visual design and content curation, has a tight focus that drives home the key issue areas that concerned Wampanoag activists and how they fit into the larger landscape of activity during the era. Color coding and UX guidance to explanatory copy equips visitors with the context needed to walk away with, hopefully, a distance impression of our intentions.

DESIGN

Estefany is working on generating launch posts for our social channels based on templates that highlight key aspects of the project content. She is pulling copy from the research documents to account for captions and longer format expressions. 

Next steps:

  • Design highlights
  • Start posting 

 

TECH

Zelda is creating CSS stylesheets and front end layouts based on finalized wireframes. Elizabeth is styling the timeline and populating entries. Connections between the database and functional elements (map/timeline) are ongoing. Our goal is to be fine-tuning a fully functioning site by the end of next week.  

Next steps:

  • Create up-to-date JSON with the live database
  • Purchase platform space and activate freestanding database after troubleshooting local expression

 

OUTREACH

Estefany has already starting building our following (almost at 30 as of posting). She’s branded our social accounts and started following key accounts and spreading awareness by word of mouth. Some team members are planning on attending the Columbia University Powwow this weekend to enjoy the event and potentially make additional connections.

 

Next steps:

  • Launch social next week
  • Emails to key folks once the website is launched.
  • Continued social audience building

RESEARCH + COPYWRITING

Research of the short list of core activists and events is entered into the database with relevant copy, images, and mapping coordinates. Time permitting we will add more—but the priority now is to get a functioning project that demonstrates our concept. 

Next steps:

  • Majel and Elizabeth  to wrap up the remaining site copy (about us page, timeline specific blurbs) and assist with outreach. 
  • Updates to the project presentation deck 

 

GS: Journal Entry (9/12), 19th April

Dear Journal,

First and foremost, I hope everyone had a fantastic break! Our team managed to get some ‘breathing space’ as everyone had the chance to either travel or see their beloved ones – or both! Nevertheless, we all kept working with alacrity on the project.

The research team provided additional directors’ bios and further contents to upload to our website, and compiled more XML files which are in the process of being reviewed for the search engine functionality. Personally, I have connected with almost everyone during the break to chat about how to manage online contents and what to display in our final webpage.

The list of tags has grown exponentially, which was somehow expected considering how difficult it is to map one’s life (especially someone we do not personally know) based on what is available online.  Work is in progress to consolidate the list we currently have and related definitions.

It is noticeable how the project has shifted from a relatively niche subset of filmmakers to a much broader pool of artists – I believe the more we explore the more we find unrecognized talents (particularly women) across a vast range of regions. In this sense, we are attempting to find an easy way to present our efforts during the final presentation as we are in agreement on how difficult it might be for someone outside our project to get a clear grasp of the purpose of our ‘datification’ mission.

 

MTS Week 13—Rounding the Bend

It feels like the course end is suddenly closing in fast. We’ve done so much in the past weeks, and most of the team even anticipated trucking along through the break—but, surprise, people needed an actual break. The amount that has been done so far is so impressive and we have so much to be proud of—at this point we just need to tie it all up into a shareable experience so people can really see and feel that effort. 

“Just” I say— but, yet it is a “just.” Just be honest with ourselves about what is doable, where we are, and where we can get. I feel positive about what we can still accomplish, but I am looking forward to sharing space with the team so we can get aligned and feel each others energy as we start to bring the project to shareable life. Everyone has been asked to take a look at their own work streams and consider places they can simplify and consolidate so we can discuss what places to streamline while getting us as close to our vision as we can. 

As for me personally, before the break, in the days just before the break I closed out the research portion of the project and briefly shifted my focus to my other classes to make sure everything was in a good position for my return (I was on a family trip that made it impossible to work). Now that we are back, I am devoting as much time as I can to writing the remaining front end copy (finished the glossary today!—anticipate the front end copy done this week), and supporting on outreach (specifically social content). These are all within my wheelhouse, so I currently feel good about getting myself in the position to finetune our presentation materials in a timely manner—family and other courses-permitting.     

Week 10_Past Midpoint Summary

We’ve managed to make it to our midpoint meeting (yesterday)!

TEXT ANALYSIS & DATA

Corpus for Season 2 and Season 12 are cleaned, tended to, and ready to be used and analyzed (further). Teddy was able to start the analysis of Season 12, and Maria was able to do initial queries on Season 2. We found some fundamental confirmations of our suspicions. The vocabulary focuses on the appearance and flavor of the bakes. Smell and texture are far less frequently mentioned. We also noticed that the vocabulary truly is limited. “Lovely,” “nice,” delicious” feature heavily in both seasons of judging language. Another discovery was the super-usage of “very” in Season 2 as a sort of calibration word: See the use of “very good.” or “very, very, good” or even “very very very good” to create, um, nuance. The small words we might have glossed over are actually turning out to be the words that corroborate our thesis. There’s also an emerging new theory about the words and short phrases that have become emblematic of GBBS, like “overbaked”, “soggy bottom,” “stodgy” etc. Right now, the suspicion is that they appear in the show at a middle frequency; they stick out just enough to become signature words. These are words we also see most often as part of GBBS memes (which: selected memes to be added to future presentations for our audience’s pleasure). The next few weeks will be about following our hunches into a more detailed analysis. Perhaps, we’ll also collect some overall statistics describing the vocabulary size and the average length of judging sentences. 

Teddy and Maria haven’t yet been able to combine and compare their findings and run deliberate comparison queries. Achieving this is also on the agenda for the next corpus meeting on April 16th. Additionally, Teddy will ensure that the GitHub repository is ready to be linked to the site by late April.

As we pass the semester’s midpoint, a potential plan to include a third comparison corpus from a different show has been abandoned. We will have enough to do with looking into the details of S2 and S12, and there are many more paths to follow with these corpora than we might have initially predicted.

SITE & GAME

We’ve also decided to take one or two striking findings and render them visually in a way that can be easily distinguished from Voyant’s renderings and is more consistent with the style of our website. RC will spearhead this effort once we turn the necessary info over to her. Until then, RC continues to work with Nuraly to find a way to integrate plug-ins so the Bingo game will work. After discovering that CUNY’s version won’t suffice, RC has upgraded to a non-CUNY business version (a cheaper alternative is also still in consideration). Initial trials of plug-ins were promising—more tbd. 

RC will work on making a logo for Ob&Up and continue to develop the site. Maria will finalize the broader aspects of site content with RC (final tab-names, i.e., what will be included and what won’t be included) around by mid-April.

Nuraly will stay in contact with RC and continue to develop and test the Bingo game. Once Teddy and Maria have finished the text analysis, Nuraly will be able to populate the game squares with verbal content. He will also keep an eye on social media activity around GBBS/GBBO and share the finished Bingo game with the GBBS Subreddit.

 

OVERALL

Our various life circumstances and work obligations sometimes pose problems for communication and collaboration. Ideally, Teddy and Maria would have found more time to collaborate directly on text analysis, but since their schedules didn’t align, getting on the same page has been difficult. There also are recurring questions about when and how documents get shared and okayed by the group. Clarifying workflow, submission rhythms, and check-in modes might make sense for our first meeting post-break, especially as we head into the final frenzy.

But, overall, we’ve (mostly) met our milestones and are gearing up for a strong finish post-break.

 

Week 10 DirectHERS Group Update

The DirectHERS team has adjusted our member roles, project scope, and work distribution since mid-February. As a result, we reached several milestones, including completing the basic TEI/XML training, making a custom-designed XML glossary, and five pilot director XML files for testing the search functionality. But our scope is no longer limited to Japanese women directors. Instead, we create a list of non-western groundbreaking directors based on our competencies and backgrounds. We also decided to use more Hypothesis annotations in our encoding as “hyperlink pointers,” which was not our aim in the initial plan. We also need to work more on the glossary, primarily focusing on our customized tags and their definitions. The initial goal for the delivery is that we have 15 directors. The good news is we probably will have more than that number. From the project manager’s perspective, management apps like Asana are not working well for us. As a small team, emails, zoom meetings, and slack are efficient enough.

Gemma wrote the following update for the web dev team:

Dev Team is working on the website, which is starting to take a more defined shape; the idea is to display a selection of the directors’ bios and photos and find a home for some of the high-level information related to the project.
The tag glossary, or dictionary, is also in the process of being added to the website; however, as this is a live document, the final design is still in progress.
Zico is exploring one option for the search engine with Tableau, which will allow filtering and comparing features; nevertheless, other options might still be viable should Tableau fail to meet the team’s expectations. (Other options: Javascript and Ajax; basic search within GitHub pages)
For the two upcoming weeks, Gemma will have one-on-one meetings with all the other team members to go over the directors’ bios assigned to them, their XML contents, and their personal bios which will be crafted in an ad-hoc manner.
The final product is now composed of six main sections, which can be divided as follows: About, the Team, the Project, Research/Inspiration, Search, and Contact.

The research team will meet next Tuesday to finalize one pilot XML file and update the tag glossary. Miaoling will update the DTD file accordingly to validate all XML structures produced by the research team. During the break, the research team will also spend some time writing lab notes, video plans, personal reflections, about page info, and any research-related documents that will be later published on the website. After April 18, the research team will move its focus to writing and stop producing more XML files.

Maria and Miaoling will start drafting tweets and outreach emails next Tuesday and during the break.

The design keeps changing and evolving as more and more contents progressively become available; it’s undeniable that there is now general excitement about the final presentation and the potential of this project!!